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1.   INTRODUCTION 

Rapid  development  in  Integrated  Circuit  Technology  together  with  the 
increasingly  widespread  use  of  computers  in  recent  years  has  brought  about 
many  significant  changes  in  computer  design  philosophy.   ICs  now  can  be  mass- 
produced  at  very  low  cost:   The  resulting  drop  in  the  relative  cost,  as  compared 
to  that  of  other  components.,  other  construction  and  operation  costs  of  a  com- 
puter, leads  to  a  new  set  of  criteria  in  determining  an  optimum  system  organi- 
zation and  the  design  of  the  detail  circuits.   These  new  criteria  inevitably 
emphasize  on  extensive  use  of  mass-producible  items  like  the  LSI  and  avoid  as 
much  as  possible  the  components  which  are  not  mass-producible  and  construction 
processes  which  are  difficult  to  automate.   On  the  other  hand,  extensive  use 
of  computer  by  people  from  all  walks  of  life  calls  for  new  types  of  computers 
with  more  flexible  system  organization,  higher  reliability  and  simpler  opera- 
tion procedures. 

Under  these  new  conditions,  a  novel  stochastic  computer  with  highly 
flexible  structure,  originally  suggested  by  Professor  Poppelbaum,  is  being 
built.   It  takes  advantage  of  the  latest  advances  in  component  technology, 
and  leads  to  new  standards  of  computer  structure  and  performance.   This  com- 
puter is  dubbed  APE  machine    as  its  basic  building  blocks  are  a  set  of 
Autonomous  Processing  KLements.   These  basic  building  blocks  consist  of 
absolutely  identical  circuits  (except  for  transmitter  frequencies),  making 
the  structure  of  the  APE  machine  highly  homogeneous.   This  homogeneous  struc- 
ture lends  itself  exceedingly  well  to  mass-production  by  integrated  circuit 
technology. 


Each  basic  building  block  is  a  small  processor  in  its  own  right. 
However  they  can  be  grouped  together  to  form  a  more  powerful  system,  Without 
any  actual  physical  hook-ups.   The  freedom  and  ease  of  incrementing  its  com- 
puting power  by  simply  adding  more  APEs  makes  the  APE  machine  a  futuristic 
design  in  keeping  with  the  changing  needs  of  computer  users:   At  the  present 
time,  an  increase  in  the  computing  power  of  a  computer  system  usually  requires 
the  replacement  of  the  system  or  a  major  processing  unit.   (For  a  small  but 
increasing  number  of  recent  models,  provision  is  made  to  allow  additional  pro- 
cessors to  be  connected  to  the  existing  facilities  to  boost  the  computing 
capability.   But  in  all  cases,  the  user  has  no  control  over  the  size  of  the 
incremental  power  and  has  little  influence  in  organizing  the  computer  system 
in  a  problem-dependent  way  in  order  to  obtain  better  performances.)   For  the 
APE  machine,  a  bigger  computing  capability  simply  means  using  more  basic  build- 
ing blocks,  and  these  building  blocks  can  be  organized  in  various  ways  to  suit 
particular  applications! 

Fault  tolerance  is  a  highly  desirable  feature  for  complicated  machines 

such  as  computers.   It  is  even  more  so  for  computers  handling  critical  jobs. 

(2)(3) 

Research  in  this  area       has  grown  rapidly  over  the  past  decade.   It  is 

quite  clear  at  this  point  that  fault  tolerance  is  going  to  be  one  of  the  most 
important  features  of  future  computers.   Should  the  need  arise,  additional 
circuitry  could  be  easily  added  to  the  APE  machine  such  that  various  fault 
tolerant  features  (such  as  static  self-checking,  dynamic  self-checking,  and 
even  self-repair)  may  be  incorporated. 

The  processing  method  of  the  APE  machine  is  based  on  the  stochastic 
processing  principles  to  be  discussed  in  Chapter  3.   Stochastic  processing  is 


used  because  the  project  APE  machine  represents  an  effort  to  explore  the  new 
approaches  to  data  processing  and  to  develop  new  processing  techniques  and  new 
hardware.   Stochastic  processing  has  been  investigated  both  in  the  U.S.  and 
abroad   '*-?'*       ',   Research  conducted  in  the  Hardware  and  Systems  Research 
Group  under  Professor  Poppelbaum  has  made  significant  contribution  to  the 
theory  and  practice  in  this  area.   Several  complex  computing  machines  based 
on  stochastic  computation  principles  have  been  successfully  constructed  over 
the  past  few  years  by  this  research  group  .   The  APE  machine  is  yet 

another  attempt  to  further  investigate  this  area. 

In  the  following  chapters,  the  system  structure  and  the  design  of 
the  APE  machine  will  be  discussed.   Some  detail  analysis  is  given  to  its  sto- 
chastic operations.   Its  limitations  as  well,  as  their  possible  solutions  are 
also  presented. 


2.   SYSTEM  DESCRIPTION  OF  THE  APE  MACHINE 

2.1  General  Description 

The  APE  machine  is  an  on-line  real-time  stochastic  computer  with 
reconfigurable  structure.   It  consists  of  a  set  of  Autonomous  Processing 
Elements  known  as  APEs,  a  set  of  sensors  to  acquire  raw  input  data  for  pro- 
cessing, a  remote  power  supply  and  a  program  control  unit.   The  system  diagram 
of  the  APE  machine  is  depicted  in  Figure  2.1.   Data  communication  "between 
various  constituents  of  the  APE  machine  is  carried  out  through  radio  frequency 
channels  only.   Power  is  also  transmitted  remotely  into  the  APEs  in  the  form 
of  light  energy.   Therefore  the  APE  machine  does  not  require  assembly  into  a 
physically  wired  network.   For  communication  with  other  parts  of  the  computer 
each  APE  has  two  tunable  data  input  channels,  a  fixed  instruction  channel,  and 
an  output  transmitting  channel  operated  at  the  same  fixed  frequency  but  on 
a  time-multiplexed  basis  with  the  instruction  channel.   The  two  data  input 
channels  can  be  tuned  to  establish  RF  linkages  with  the  output  of  any  other 
APEs  or  sensors.   The  APE  can  perform  on  its  two  input  data  any  of  the  opera- 
tions of  addition,  subtraction,  multiplication,  division,  differentiation, 
integration  and  storage.   The  tuning  of  the  data  input  channels  as  well  as 
the  setting  for  an  APE  to  perform  a  specific  operation  is  done  remotely  through 
the  instruction  channel  linking  with  the  control  unit.   The  sensors  provide 
the  system  interface  with  the  outside  world:   They  convert  the  input  variables, 
such  as  temperature,  light  intensity,  etc.,  into  machine  variables  compatible 
with  the  data  input  channels  of  the  APEs. 

In  Figure  2.1,  the  APEs  are  symbolized  by  triangles  with  i  and  j 
denoting  the  tunable  data  input  channels  and  k  denoting  the  fixed  output 
channel.   A  specific  output  channel  k  is  identified  by  its  channel  frequency 
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v    There  are  N  number  of  different  output  channels  available  to  the  system. 
As  mentioned  before  the  computational  power  of  the  APE  machine  is  incrementable, 
However,  the  increment  is  not  without  bound.   The  number  N  is  a  parameter 
indicating  the  maximum  computational  power  which  a  particular  APE  machine  can 
obtain.   It  should  be  noted  that  the  number  of  APEs  in  a  specific  APE  machine 
might  be  greater  or  smaller  than  N.   In  other  words,  it  is  allowed  to  have 
redundant  APEs  operating  in  parallel  or  completely  missing  APEs  for  any  parti- 
cular channel.   This  will  be  discussed  in  more  detail  later  on.   A  specific 
operation  instruction  for  an  APE  is  denoted  by  6 .   It  could  be  any  one  of  the 
seven  operations  mentioned  above.   The  sensors  are  denoted  by  a  circle  in 
Figure  2.1.   Their  outputs  are  transmitted  through  channels  other  than  those 
for  the  APEs.   Their  channel  frequencies  are  denoted  by  v„+, . 

A  simple  example  of  organizing  the  APEs  and  sensors  to  perform 

2 
a*b+c   is  illustrated  in  Figure  2.2.   In  this  example,  three  input  variables 

a,  b  and  c  are  provided  by  sensors  S  ,  S_  and  S_.   Their  values  are  encoded 

and  transmitted  from  their  respective  output  channels.   The  input  channels  i 

and  j  of  the  APEs  belonging  to  the  channels  denoted  by  1  and  m  are  tuned 

to  receive  input  data  from  S  ,  Sp  and  S_  as  shown.   APEs  in  channels  1  and 

m  are  set  to  perform  multiplication.   The  APE  or  APEs  operating  in  the  channel 

denoted  by  n,  with  their  data  input  channels  tuned  to  receive  input  data  from 

the  output  of  APE  channel  1  and  APE  channel  m,  are  set  to  perform  addition. 

2 
The  final  result  a*b  +  c   is  then  transmitted  at  frequency  v  .  The  program  con- 
trol unit  is  equipped  with  an  all-channel  receiver.   It  can  be  set  to  receive 

2 
the  final  result  a'b  +  c   from  APE  channel  n  as  well  as  outputs  from  any 


7 

other  APE  channels  and  sensor  channels.   The  display  of  the  result  is  done  by 
means  of  a  Nixie-tube  display  panel.   For  more  permanent  outputs,  provision 
is  made  to  interface  with  a  teletype  for  printing  the  outputs. 

Programming  of  the  APE  machine  can  be  carried  out  by  selecting  two 
APE  channels  i  and  j  for  the  two  tunable  data  input  receivers  and  assigning  a 
specific  operation  0  for  each  of  the  APEs  involved,  as  symbolized  by  the  i, 
j  and  k  dials  together  with  a  row  of  buttons  for  different  operations  in 
Figure  2.1.   The  clock  is  used  for  synchronization  while  the  'Alive'  and  'Set' 
buttons  indicate  part  of  the   fault  tolerant  features  and  are  used  to  test 
an  APE  channel  before  it  is  put  into  operation.   More  detail  about  synchroni- 
zation and  testing  will  be  discussed  later.   The  topology  table  of  the  control 
unit  shown  in  the  figure  simply  serves  symbolically  as  a  convenient  place  to 
keep  a  record  of  the  program  of  the  APE  machine. 

The  APE  machine  has  three  different  modes  of  operation.   In  the  test 
mode,  test  signals  are  sent  out  to  check  all  APE  channels  involved  in  the  pro- 
gram to  be  executed.   In  the  programming  mode,  the  APEs  and  sensors  are 
organized  in  a  suitable  structure  according  to  the  program.   During  the  execu- 
tion mode,  data  transmission  and  data  processing  are  done  by  the  APEs  in  a 
synchronous  manner.   All  APEs  communicate  with  each  other  during  the  communica- 
tion period  of  the  computing  cycle  and  process  the  data  during  the  remaining 
part  of  the  computing  cycle. 

2.2  The  Modulation  Scheme  Employed  by  the  APE  Machine  for  Data  Communication 
Special  consideration  is  required  in  choosing  the  proper  modulation 
|  scheme  so  that  more  than  one  APE  having  identical  output  channel  frequency 
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are  allowed.   In  the  case  of  several  APEs  having  the  same  output  channel,  they 
are  said  to  he  of  the  same  type.   All  APEs  of  the  same  type  operate  in  parallel 
in  every  respect.   Their  corresponding  input  data  channels  are  tuned  to 
receive  data  from  the  same  source,  and  they  are  set  to  perform  the  same  opera- 
tion.  In  other  words,  the  set  of  APEs  actually  consists  of  many  different 
types  of  APEs.   A  specific  type  of  APE  has  the  same  specific  fixed  frequency 
channel  for  instruction  input  and  its  data  output.   Whenever  an  instruction 
is  sent  out  from  the  program  control  unit,  all  APEs  of  the  same  type  will 
be  programmed  exactly  the  same  way.   To  guarantee  a  proper  reception  of  data 
from  the  same  type  of  APEs  by  the  APEs  in  the  following  stage,  a  synchronous 
pulse  width  modulation  scheme  is  employed  for  data  communication.   All  APEs 
and  sensors  are  synchronized  for  data  transmission  and  reception:   At  exactly 
the  same  time,  each  of  the  APEs  and  sensors  begins  to  transmit  an  RF  pulse 
whose  width  carries  the  information.   If  there  are  several  APEs  in  one  parti- 
cular channel,  they  stop  transmission  at  about  the  same  time  due  to  the  fact 
that  the  data  sent  out  from  each  of  them  are  equal  (to  within  the  accuracy 
of  the  processor  of  the  APEs).   With  this  kind  of  modulation  scheme,  it  is 
evident  that  the  data  transmission  is  not  seriously  degraded  by  the  existence 
of  redundant  APEs. 

2-3  Flexibility  in  Forming  the  Get  of  APEs  and  Fault  Tolerant  Capability  of 
the  APE  "achine 

As   indicated  in  Figure  2.1,  the  program  control  unit  can  send  out  a 
test  signal  to  every  APE  channel.   Depending  on  the  complexity  of  the  test,  the 
operator  could  find  out  from  the  response  how  well  the  APE  or  APEs  of  that 
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channel  is  functioning  before  he  tries  to  put  that  type  of  APEs  into  operation, 
Only  a  simple  existence  test  for  an  APE  in  a  specific  channel  is  implemented 
in  our  machine:   A  negative  response  from  this  test  would  indicate  either  that 
there  is  no  APE  in  the  set  belonging  to  this  type  or  that  none  of  the  APEs 
of  this  channel  are  functioning  properly.   Then  this  channel  is  simply  not  to 
be  put  into  operation  . . . 

The  freedom  to  have  redundant  APEs  of  the  same  type  operating  in 
parallel  as  well  as  to  have  no  APE  or  no  properly  functioning  APE  in  a  parti- 
cular channel  leads  directly  to  some  important  features  of  the  APE  machine. 
First,  the  set  of  APEs  can  be  grouped  together  untested  and  in  a  random 
fashion.   In  the  imminent  LSI  era,  an  APE  being  build  as  a  single  LSI  chip 
becomes  a  realistic  possibility.   The  fact  that  testing  is  not  needed  helps 
to  simplify  the  production  of  these  LSI  chips.   Ultimately,  it  is  quite  con- 
ceivable that  the  APEs  could  be  mass  produced  at  low  cost:   Whenever  a  set 
of  APEs  is  needed  to  set  up  an  APE  machine,  they  could  just  be  picked  up  at 
random  from  a  large  stock  of  untested  LSI  chips.   If  too  many  of  them  are 
redundant  or  perhaps  do  not  function,  the  operator  simply  throws  more  LSI 
chips  into  the  set  of  the  APEs  until  he  has  sufficient  number  of  different 
types  of  APEs  (i.e.  different  frequencies)  at  his  disposal  for  the  job. 
Secondly,  the  redundant  APEs  increase  the  reliability  of  the  APE  machine.   Of 
course  there  are  many  possible  ways  a  malfunction  could  occur.   All  those  mal- 
functions which  result  in  sending  shorter  output  pulses  will  have  no  signifi- 
cant effect  upon  the  APE  machine  if  there  is  at  least  one  APE  in  each  type 
functioning  properly. 
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2.k      The  Structure  of  an  APE 

A  functional  diagram  of  an  APE  is  given  in  Figure  2.3.  It  has  a 
instruction  receiver  tuned  permanently  to  channel  v   for  the  k  type  APE 
elements.   During  the  programming  mode  the  transmitter  of  every  APE  is  bein^ 
shut  off,  and  the  instruction  receiver  is  activated  to  receive  programming 
instructions  from  the  APE  control  unit.   The  program  instruction  carries 
the  information  of  how  to  tune  the  data  receivers  A  and  B  as  well  as  what 
specific  operation  to  perform.   This  information  is  stored  and  decoded  by 
the  decoder  for  function.   After  completing  the  programming  of  all  APEs 
involved  in  a  specific  program,  the  APE  machine  is  switched  to  the  execution 
mode.   The  two  tunable  data  receivers  A  and  B  are  now  tuned  according  to  the 
instruction.   The  data  which  they  receive  are  in  duty  cycle  modulation.   They 
are  converted  into  binary  numbers  and  stored  by  two  duty  cycle  decoders  as 
shown  in  the  figure.   These  binary  numbers  x  and  y  are  now  fed  into  the 
stochastic  computing  element  for  a  specific  type  of  processing  according  to 
the  value  of  6  from  the  decoder  for  function  .   The  processing  result  z  is 
in  binary  and  is  again  encoded  into  duty  cycle  representation  before  it  is 
transmitted.   The  clock  receiver  provides  the  synchronization  for  the  trans- 
mission of  the  output  data,  and  it  also  carries  the  mode  control  signal.   The 
'alive'  reply  signal  control  is  incorporated  to  answer  a  test  signal  from 
the  control  unit.   The  answer  is  also  sent  out  through  the  output  transmitter. 
The  details  of  structure  and  the  operation  of  the  APE  will  be  discussed  in 
Chapter  5. 

2.5   Power  Supply  of  the  APE 

The  APEs  are  powered  by  four  solar  cells  illuminated  by  incandescent 
lamps.   The  details  on  this  part  of  the  project  is  given  in  Chpater  7. 
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STOCHASTIC  DATA  PROCESSING  METHODS  OF  THE  APE  MACHINE  AND  THEIR 
STOCHASTIC  PROPERTIES 


3.1  Number  Representation  in  Stochastic  Data  Processing 

During  the  past  decade,  considerable  interest  has  arisen  in  a  new 
way  of  representing  numbers  on  electronic  computers.   Unlike  the  common  met  nods 
of  representing  a  number  by  a  continuous  electrical  quantity  or  by  a  temporal 
sequence  of  discrete  electrical  quantity,  the  new  method  represents  a  number 
by  the  statistical  average  of  a  random  variable  in  a  discrete  stochastic  pro- 
cess.  Although  this  is  not  a  very  efficient  way  to  represent  numbers  from  the 
information  encoding  point  of  view,  extremely  simple  digital  hardware  is  suf- 
ficient to  process  data  in  this  representation.   The  most  striking  advantage 
of  this  representation  of  numbers  lies  in  the  fact  that  any  Boolean  operation 
on  binary  variables  corresponds  to  an  arithmetic  operation  on  the  numbers 
corresponding  to  a  sequence  of  these  binary  variables.   Therefore  a  simple  logic 
gate  can  perform  a  complicated  arithmetic  operation  which  would  require  scores 
of  gates  to  perform  if  the  numbers  were  in  common  representation.   Several 
different  types  of  stochastic  number  representations       have  been  developed. 
The  type  being  employed  for  the  APE  machine  is  called  synchronous  random 
pulse  sequence  or  simple  SRPS.   In  this  case,  a  number  is  represented  by 
the  probability  of  occurrence  of  a  pulse  in  any  time  slot  of  a  clocked  time 
pulse  sequence.   An  example  of  a  SRPS  is  depicted  in  Figure  3.1.   In  terms  of 
mathematical  language,  a  SRPS  is  a  discrete  stochastic  process  denoted  by  a 

sequence  of  statistically  independent  random  variables  X  where  X  is  a 

n        n 

Boolean  variable.   For  a  specific  n,  it  can  be  either  '1'  or  '0'.   The  sub- 
script n  covers  the  entire  positive  integer.   The  m   order  joint  distribution 

function  of  X  ,  is  given  by  P(X  ,X0,  ...  X  ).   Because  X.  and  X.  with  1  4   J 
n  12m  i     j 
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are  independent  random  variables,  it  follows  that 


m 

p(x  ,  x2,  ...,  xm)  =    n  p(xn)  3.1 

1  n=l 

To  map  a  number  into  a  SRPS ,  one  must  first  of  all  normalize  the 

number  and  express  it  as  a  fraction  of  the  full  range.   Then  a  SRPS  is  to  be 
produced  such  that  the  probability  of  occurrence  of  a  pulse  in  any  clock 
period  equals  that  fraction.   In  general,  the  SRPS  is  a  non-stationary  and 
therefore  non-ergodic  process  because  the  number  which  it  represents  is  gen- 
erally a  function  of  time.   However,  if  the  time  average  of  a  SRPS  is  computed 
over  a  period  in  which  the  number  it  represents  is  held  fixed,  as  it  is 
precisely  the  case  for  the  APE,  a  SRPS  can  be  treated  as  an  ergodic  process. 
It  follows  that  the  time  average  of  any  sample  function  of  a  SRPS  is  identical 
to  the  statistical  average.   This  property  allows  the  conversion  of  a  number 
from  stochastic  representation  back  to  the  common  analog  representation 
s imply  by  integration  over  a  sampling  time  period. 

3.2  Some  Simple  Boolean  Operations  on  SRPS 

Suppose  two  SRPS's  X  and  Y   represent  two  numbers  x  and  y  respectively. 

If  an  AND  gate  is  fed  by  these  two  SRPS's,  the  output  is  visibly  a  new  SRPS 

with  the  probability  of  having  a  pulse  in  any  time  slot  given  by  P(X  =1,  Y  =l). 

Let  the  resulting  SRPS  be  denoted  by  Z  and  the  number  it  represents  by  z.   Then 

the  statistical  mean  of  Z   is  given  by 

n 

Exp  {Z  }  =  P(Z  =1)  =  P(X  =1,  Y  =1)  3.2 

r     n       n         n     n 

If  X  and  Y  are  statistically  independent,  Equation  3.2  becomes 

Exp{Z  }  =  P(X  =1)  •  P(Y  =1)  3.3 

*     II       n         n 
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Recalling  that 

Exp^Z   }   =  P(Z  =1)   =   z  3.U 

c 

Exp^X  >   =  P(X  =1)   =  x  3.5 

Exp^Y  >   =  P(Y  =1)   =  y  3.6 

n  n 

Equation  3.3   "becomes 

z  =  x*y  3.7 

This  means  that  a  simple  AND  gate  is  what  is  needed  to  perform 

multiplication  for  numbers  in  SRPSs.   Similarly,  it  can  be  shown  that  two 

numbers  in  SRPSs  X  and  Y  are  mutually  exclusive,  their  Boolean  OR  operation 

n      n 

corresponds  to  the  addition  operation. 

3.3  Representation  of  Negative  Numbers  by  SRPS 

Since  the  range  of  the  value  of  probability  is  from  zero  to  one, 
it  becomes  necessary  to  have  a  suitable  transformation  before  a  negative 
number  can  be  represented  by  a  SRPS.   Without  loss  of  generality,  one  only 
needs  to  examine  the  transformation  of  a  variable  in  a  closed  region 
i-l,  +l)  into  another  variable  in  the  closed  region  (09   l) .   Variable  values 
outside  the  range  of  (  -1,  +l)  can  always  be  normalized  to  within  this  range. 
To  avoid  confusion,  it  is  helpful  to  keep  in  mind  the  distinction  between 
three  different  notations  all  representing  the  same  quantity  but  in  different 
representation 

X  :   external  variable  with  range  (  -1,  +l) 

x:   machine  variable  with  range  {0,  1} 
X  :   SRPS  representing  x 


given  by 


IT 


For  an  APE,  the  transformation  between  X  and  x  is  linear  and  is 
'  e 


X  =  1  -  2x  3.8 

e 


With  this  type  of  transformation,  it  can  be  shown  that  the  SRPS 
representing  x  can  be  easily  converted  to  one  representing  -x  by  passing  it 
through  an  inverter. 

3. k     Addition  and  Subtraction  Operations  in  Terms  of  Machine  Variables 

Addition  in  terms  of  external  variable  is  given  by 

Z  =  X  +  Y  3.9 

e    e    e 

with  X  and  Y  being  the  input  data  and  Z  being  the  sum.   The  summation  in 

terms  of  machine  variables  can  be  obtained  readily.   Let  x,  y,  and  z  be  the 

machine  variables  corresponding  to  Z  ,  X  ,  and  Y  respectively.   According 

to  Equation  3.8 

Z  =  1  -  2z 
e 


X  =  1  -  2x  3.10 

e 

Ye  =  1  -  2y 

Substituting  Equation  3.10  into  Equation  3.9  we  get 

z  =  x  +  y  -  1/2  3.11 

Similarly  for  subtraction,  given  by  Z  =  X  -  Y  ,  the  operation 
in  terms  of  machine  variables  can  be  expressed  by 

z  =  x  -  y  +  1/2  3.12 

The  hardware  circuit  for  implementing  the  addition  and  subtraction 
operations  is  shown  in  Figure  3.2  The  processor  itself  consists  of  only  two 
AND  gates,  one  OR  gate  and  an  inverter.   The  variables  to  be  processed,  namely 
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X  and  Y   are  in  SRPS  representation.   They  are  gated  alternatively  through  the 
n      n 

AND  gates  by  the  complementary  gating  clock  signals.   This  insures  that  the  two 
operand  SRPSs  are  mutually  exclusive.   The  gating  clock  C   has  a  duty  cycle 
of  precisely  50%.   The  output  from  the  OR  gate  would  therefore  consist  of  half 
of  each  input  pulse  sequence  of  X  and  Y  .   This  mixed  sequence  is  subsequently 
gated  into  the  second  least  significant  bit  of  the  output  counter.   The  func- 
tion of  the  counter  is  to  convert  its  SRPS  input  into  the  binary  number.   Tnis 
is  done  by  counting  the  number  of  pulses  in  the  SRPS  over  as  many  clock  per- 
iods as  the  maximum  counts  of  the  counter.   The  final  count  would  therefore 
be  an  estimation  of  the  time  average  of  the  input  SRPS.   In  other  words,  the 
counter  converts  the  value  of  a  variable  in  SRPS  representation  into  binary 
representation.   When  the  input  is  fed  into  the  2nd  bit  of  the  counter,  it 
amounts  to  multiplying  the  input  by  a  factor  of  two.   Because  only  half  of  each 
input  SRPSs  X  and  Y  has  been  gated  into  the  counter,  the  multiplying  factor 
of  two  would  make  the  final  count  correspond  to  the  summation  of  the  two 
inputs  X  and  Y  .   At  the  beginning  of  each  integrating  period,  the  output 
counter  is  preset  to  half  full.   Since  the  counter  is  operated  in  a  modulo 
fashion,  this  means  subtracting  1/2  from  the  result.   Therefore  the  final  count 
of  the  output  counter  indicates  the  estimate  of  x  +  y  -  1/2,  which  is  the 
summation  of  the  two  input  variables  in  terms  of  machine  variables  as  expressed 
in  Equation  3.11. 

As  mentioned  earlier,  inverting  a  SRPS  corresponds  to  inverting 
the  sign  of  the  number  it  represents.   Therefore,  subtraction  merely  amounts 
1  i  inverting  the  subtrahend  before  doing  the  addition,  as  indicated  in 
Pigure  3.2. 
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3.5  Operation  of  Multiplication  in  Terms  of  Machine  Variables 

As  in  the  case  of  addition  or  subtraction,  multiplication  in  terms  of 

machine  variable  also  changes  its  form  under  the  transforation  of  Equation  3.8, 

In  terms  of  external  variables,  multiplication  is  given  by 

Z  =  X  -Y  3.13 

e    e  e 


where  X   Y  and  Z  are  the  multiplicand,  multiplier  and  the  product  respec- 
e'   e      e 

tively.   Upon  transforming  these  variables  into  machine  variables,  the  multi- 
plication operation  is  then  given  by 

z  =  x  +  y  -2x*y  3.1*+ 

The  hardware  required  to  implement  Equation  3.1*+  turns  out  to  be  a 
simple  EXCLUSIVE  OR  gate  as  shown  in  Figure  3.3. 

3.6  Division  in  Terms  of  Machine  Variables 

Like  the  other  operations  discussed  above,  the  division  operation 

in  terms  of  external  variables  is  different  from  the  one  in  terms  of  machine 

variables.   Let  Z  ,  Y  and  X  be  the  quotient,  numerator  and  denominator 
e    e      e 

respectively.   The  relation  of  their  corresponding  machine  variables  can  be 
obtained  by  applying  Equation  3.8  to  the  expression  of  division  operation. 
The  result  is 

— H-i 

For  the  implementation  of  this  expression,  somewhat  more  circuitry 
is  involved.   To  simply  the  hardware  design,  Equation  3.15  has  to  be  rewritten 
to  match  the  hardware  characteristics.   Since  z  is  necessarily  non-negative, 
z  can  therefore  also  be  expressed  as 
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3.16 


.  1  -  2x 

In  absolute  value,  the  numerator  and  the  denominator  can  be  computed 
more  easily.   As  mentioned  in  Chapter  2,  the  machine  variables  are  transmitted 
and  received  in  duty  cycle  modulated  form,  i.e.  the  width  of  the  transmitted 
pulse  carries  the  information  about  the  value  of  the  machine  variable.   Happily 
enough,  a  simple  EXCLUSIVE  OR  gate  is  all  it  needs  to  compute  the  absolute 
value  of  the  difference  of  two  machine  variables  in  duty  cycle  modulated  form, 
as  depicted  in  Figure  3.H.   To  compute  the  denominator  |  1  -  2x|  ,  the  first 
step  is  to  feed  the  input  decoding  counter  of  the  duty  cycle  decoder  (which 
will  be  discussed  in  more  detail  later  on)   by  the  2nd  least  significant 
bit  instead  of  the  least  significant  bit.   This  corresponds  to  entering 
2*x  into  the  input  decoding  counter.   This  counter  originally  is  of  10  bits 
length  to  match  the  range  of  the  numbers  it  handles.   In  order  to  compute 
the  denominator  of  Equation  3.l6,  one  more  bit  is  added  to  the  most  signifi- 
cant position.   This  extra  bit  is  referred  to  as  the  control  bit.   This  con- 
trol bit  is  used  to  control  a  TRUE/ COMPLEMENT  gate  as  shown  in  Figure  3.5 
The  remaining  10  bits  of  the  decoding  counter  represents  the  data  in  binary 
fraction.   If  x  is  less  than  one  half,  or  2*x  <1,  the  control  bit  is  a  '0'. 
The  TRUE/ COMPLEMENT  gate  would  gate  the  complement  of  the  10-bit  binary  frac- 
tion, i.e.,  1  -  2x,  to  its  output.   On  the  other  hand,  if  2x  >  1  the  control 
bit  is  '1»  and  the  content  of  the  10  data  bits  equals  to  2x  -  1.   These  10 
bits  are  directly  gated  to  the  output  of  the  TRUE /COMPLEMENT  gate.   Combining 
these  two  cases,  the  output  of  the  TRUE/ COMPLEMENT  gate  is  given  by  |1  -  2x|  . 

After  obtaining  the  numerator  |x  -  yj  and  the  denominator  |l  -  2x| , 
it  is  now  necessary  to  find  the  quotient.   In  order  to  explain  the  operation 
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clearly,  however,  it  would  be  "best  to  examine  the  following  case  first.   Suppose 

a  number  a  is  re-presented  by  a  SRPS  A  .   Let  this  SRPS  A  be  gated  into  a  counter 
*  v         n  n 

and  let  the  output  reading  of  different  bits  of  the  counter  represent  a  binary 
fraction.   Then  the  time  it  takes  for  this  counter  to  accumulate  enough  counts 
to  make  the  output  reading  equal  to  a  given  binary  fraction  n  is  a  random  vari- 
able with  mean  value  n/a  .   This  relation  can  be  applied  to  perform  the  division 
operation.   To  do  so,  let  the  binary  fraction  n  be  equal  to  |x  -  y|  and  let 
the  number  a,  which  the  SRPS  A  represents,  equal  |l  -  2x| .   Then  the  quotient 
| x  —  y | / | 1  —  2x |  is  obtained  by  counting  the  SRPS  representing  |l  -  2x|  until 
the  counter  reading  the  binary  fraction  is  equal  to  |x  -  y| .   The  time  it  takes 
to  accomplish  this  is  then  the  quotient,  already  in  duty  cycle  modulation  form. 
The  hardware  required  for  the  division  operation  is  actually  not  nearly  as 
complicated  as  it  appears  in  Figure  3.5-   This  is  because  the  decoding,  encoding 
and  storage  circuits  are  also  included  in  the  figure. 

3.7   Statistical  Properties  of  Stochastic  Processing  Operations  of  the  APE 
In  this  section,  a  statistical  analysis  is  given  of  l)  the  SRPS, 

2)  the  result  of  arithmetic  operations  on  numbers  represented  by  SRPSs,  and 

3)  the  result  obtained  by  cascading  several  APEs  together. 
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3.J.1  Statistical  Properties  of  an  SRPS 

As  mentioned  before,  an  SRPS  corresponds  in  general  to  a  non- 
stationary  stochastic  process.   However,  it  can  "be  treated  as  ergodic  process 
if  the  number  it  represents  is  held  fixed  for  each  computing  cycle.   Since 
this  is  the  case  for  the  APE  machine,  the  SRPS  is  treated  as  ergodic  through- 
out the  remainder  of  this  paper. 

3.7.I.I  Statistical  Parameters  and  Sampling  Statistics  of  an  SRPS 

An  SRPS  X  representing  a  number  x  has  a  distribution  function 
n 

P(X.=l)  =  x  for  all  i.   If  an  SRPS  X  is  given,  it  is  impossible  to  find  out 
exactly  what  the  number  x  is  in  a  finite  period  of  time.   The  best  that  can 
be  done  in  a  finite  period  of  time  is  to  find  an  estimate  of  x.   The  estimate 
is  obtained  by  integrating  or  (for  our  case)  counting  over  a  finite  number 

K  of  clock  periods.   The  result  is  called  sampling  mean  and  is  given  by 

K  X. 
(X  )   =  Z     _i  3.16a 

n  K   1=1  K 

This  sampling  mean  is  also  a  random  variable  and  has  its  mean  y/   \ 

UnjK 
and  standard  deviation  oty-    \        The  objects  of  the  section  are  to  derive  these 

■nV 

parameters. 

Let  the  probability  of  having  L  number  of  l's  in  K  number  of  clock 

periods  for  an  SRPS  X  be  P(K,  L).   Since  the  SRPS  X  is  a  sequence  of  indenen- 

n  n  * 

dent  random  variables  with  a  distribution  function  P(X.=l)  =  x,  P(K,  L)  is 
binomially  distributed  and  is  given  by 
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K!    L,    sK-L 

p(K>L)=LKirrT!x  (1"x)  3a7 

By  comparing  their  definitions,  it  is  not  difficult  to  find  that 

P(K,  L)  is  actually  the  distribution  function  for  (X  )   with  (X  )   given 

n  a.        n  K 

by  L/K.   Therefore  one  can  write 

with  (X  )v   =  L/K 
n  K. 

Since  (X  )   is  binomially  distributed,  its  mean  and  standard  deviation  are 


well  known  and  given  by 


^(Xn)K  =  x  3.19 


(X  L  =  ^-^  3.20 


n'K   :        k- 
V   K 

3.T-1.2  The  Relation  between  the  Confidence  Level,  Error,  and  the 
Integration  Time 

The  most  general  approach  to  this  problem  uses  the  Tchebychef f ' s 

inequality  which  states  that  for  a  random  variable  X  having  an  arbitrary 

distribution  with  mean  u  and  standard  deviation  o ,  the  following  inequality 

holds  for  every  e   >    0 

2 
P(|x  -  y|  >  e)<  ~  3*21 

e 

The  confidence  level  for  the  random  variable  X  lying  within  the 

maximum  allowable  error  c  from  its  mean  p  is  given  by 

a  ■  P(|x  -  y| <  e)  3.22 

Because   of  the   fact   that 

P(|x  -  -n|  <    e)  +  P(|x  -  y|   >  e)  ■  1  3.23 

confidence  level  a  can  also  be  expressed  as 

a   ■  1  -  P(|x  -  y|  >  e)  3.2U 


28 
Now  if  the  random  variable  is  (X^,  Equation  3.21  becomes  . 

p(|(xn)K-H{    )K|  >.>*!%*  3.2^ 

By  combining  Equations  3.2H,  3.23  and  3.20  one  can  obtain  the  relation  between 
confidence  level  a ,  maximum  allowable  error  e ,  and  the  minimum  number  of  clock 
periods  K  over  which  the  integration  must  be  taken: 


x 


(1-x) 


K>  —0 3.25 

e2(l-^) 

However,  recalling  that  0  <  x   <  1, 

x(l  -  x)  <  1/k  3.26 

By  substituting  Equation  3.26   into  Equation  3.25,    one   gets 

1 


K  > 


i     2d      \  3.27 

4e    ll-aj 


For  example,  in  order  to  find  out  the  time  average  of  a  SRPS  by  taking  the 

k  ai 

average  over  5  x  10  clock  periods,  one  will  have  at  least  95%  confidence  in 

the  reading  being  within  1%   of  the  exactly  correct  result.   The  fact  that  at 

least  95%  confidence  instead  of  simple  95%  confidence  is  due  to  the  'less  than 

or  equal  to'  relation  in  the  Tchebycheff ' s  inequality. 

Equation  3.27  is  derived  from  Tchebycheff ' s  inequality,  which  is 

applicable  to  random  variable  with  any  distribution  function.   Consequently, 

Equation  3.27  represents  a  requirement  somewhat  unnecessarily  restrictive  for 

the  binomially  distributed  (X  )  . 

n  K 

A  less  general  approach,  which  leads  to  a  weaker  requirement  than 

that  of  Equation  3.27,  takes  into  account  the  specific  distribution  of  (X  )_,„ 

n  K 

Based  on  central  limit  theorem  of  statistics,  this  approach  approximates 
the  binomial  distrubition  of  (X  )   with  a  normal  distribution. 


!(x  )K-x]2       29 

n  K 

P{(Xn»K)=7?r77-—  e  ^Vk  3.28 

1Vk 

The  confidence  level  a  with  maximum  allowable  error  e  is  given  by 

X+  E 

»-  'p((Xn)K>  ^\h  3.29 

"  X-  £ 

By  defining  a  new  variable 

Z  =    "K   ~  3.30 

and  substituting  Equation  3.28  into  3.29,  it  becomes 

w 

a  =  - -  e    dZ  3.31 

-w 

where  W  =  /-q  L  3.32 

UnJK 

In  terms   of  the  error   function  ERF,   Equation   3.31  becomes 

a  =  EBFinm -}  3.33 

y^  °  (y   ) 
UnjK 

substituting  Equations  3.20,  3.26  into  Equation  3.33,  one  can  write 

a  =  ERF(e/2K}  3.3^ 

This  gives  the  relation  between  a,  e,  and  K.   The  validity  of  the  approxima- 
tion being  made  in  this  approach  requires  a  large  K.   In  the  case  of  the  APiS 
machine  K  is  over  10  ,  Equation  3.3^  gives  a  good  approximation*  to  the 
relation  between  the  parameters  involved.   For  the  example  of  95%   confidence 

with  1%   maximum  allowable  error,  the  integration  must  be  taken  over  approxi- 

U  k 

mately  K  =  10  periods.   Comparing  with  the  result  K  =  5  x  10  obtained  from 


According  bo  the  diBCiission  given  in  Reference  (20),  the  error  produced  by 

'A: 


5  1 

proximation  for  the  case  in  which  K  =  10  and  Exp[(X^)v]  =  —  is  less 


II..-.    f\        -5  0/ 
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the  previous  approach,  they  differ  by  a  factor  of  5-   However,  they  are  still 
consistent  because  what  the  previous  approcah  concludes  is  that  for  K  =  5  x  10  , 
the  reading  provides  at  least  95%  confidence  and  accurates  to  within  1%.      The 
result  of  the  second  approach  obviously  agrees  with  it. 

3.7.2  Statistical  Properties  of  the  Result  of  Addition  and  Subtraction  by 
an  APE 

A  close  examination  of  the  operation  of  addition  reveals  that  inte- 
grating the  sum  over  K  clock  periods  corresponds  to  integrating  the  addend  and 
the  augend  over  -r   clock  periods  each.   Let  x,  y  be  the  addend  and  the  augend 
respectively.   The  sum  in  terms  of  machine  number  is  given  by  z  =  x  +  y  -  — 
as  shown  before.   According  to  Equations  3.19  and  3.20  the  sampling  statistics 
of  a  sample  size  of  H  =  t  from  X  and  Y  are  given  by 

H  X. 
V,Y  ,  =  E*p(  ^  -|}  =  x  3.35 

^VH       i=l 


o  Zx(l-X) 

(Xn}H~    H 


3.36 


H  Y. 
y(y  j  =  Exp{r  -|}  =  y  3#37 


'n'H      i=l 


°(Y  V  "    H  3'38 

n  H 


It  can  be  shown  that  the  sampling  mean  of  the  sum  is  given  by 

3.39 
By  substituting  Equations  3.35,  3.37  into  Equation  3.39,  one  can  write 


"<2A  ■  M<Xn>H  +  "<VH  "  ^ 


V..  =x  +  y--  3.U0 

Z  2 

n  K 

It  can  also  be  shown  that  the  sampling  standard  deviation  of  the  sum  is 
given  by 


f*(i-x)  ,  yd-y)  3M 


a(z  ) 

n;K  '   2K       2K 
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Substituting  Equation  3.^1  into  Tchebychef f s  inequality,  one  gets 

{i_o)<   X(l-X)  +  y(l-y)  3M 

2Ke 

Recall  that  x(l-x)<     r-9   y(l-y)    <  v*.    Consequently,   Equation   3.^2   can  be  writtei 

K>  1 


he    (l-a)  2.I43 

This  gives  a  relation  between  confidence  level  a ,  maximum  allowable  error  e 

and  the  sampling  size  K,  based  on  the  1st  approach.   The  result  based  on 

the  2nd  approach  can  also  be  obtained  by  replacing  the  <j  ,      •,      in  Equation  3.33 

n  K 
with  a   >  .   The  result  is  given  by 
UnjK 

a  =  ERFU/2K  }  3.UU 

The  subtraction  operation  has  the  same  statistical  properties  as  the  addi- 
tion operation,  because  subtraction  is  done  exactly  the  same  way  as  the 
addition  operation,  except  that  the  subtrahand  SRPS  is  inverted  before  it  is 
added  to  the  minuend.   This  only  amounts  to  changing  the  value  x  of  the  sub- 
trahand SRPS  X  to  (l-x)  before  the  addition  operation  takes  place, 
n 

3*7.3  Statistical  Properties  of  the  Product  of  Multiplication  Operation 

Let  z,  x  and  y  represent  the  product,  the  multiplicand  and  the 
multiplier.   As  mentioned  in  Section  3.5,  the  sequence  obtained  at  the  out- 
put of  the  EXCLUSIVE-OR  gate  is  an  SRPS  representing  the  product  with  the 
mean  value  given  by 

MZ  =  z 

=  x  +  y  -  2x-y  3.U5 

All  the  result  about  the  statistical  properties  of  an  SRPS  derived  in 
3.7.1  are  therefore  applicable  for  the  product.   In  particular,  the  fluctua- 
tion of  the  samples  is  given  by 
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/(x+y  -  2x-y)(l-x  -y  +  2x-y)  ,  > 

°(Z  )_  "  /  K  3'hb 

n  H 

The  relations  "between  a  ,  K,  and  e  for  the  product  is  identical  to  the  ones 

given  in  Equation  3. 27  and  Equation  3.31*. 

3.7.4  Statistical  Properties  of  the  Quotient  in  Division  Operations 

It  was  shown  in  Section  3.6  that  the  Quotient  Z  of  the  division 
operation  is  a  random  variable.   Its  statistical  properties  is  now  being 
investigated.   The  distribution  function  of  Z   corresponds  to  the  probabilities 
of  getting  a  fixed  number  of  pulses  for  different  numbers  of  clock  periods. 
Let  T.  be  the  number  of  clock  periods  between  the  i  *  pulse  and  its  follow- 
ing pulse  in  a  sample  function  of  an  SRPS.   The  total  number  of  clock 
periods  T  for  an  SRPS  D  ,  representing  the  denominator  in  Equation  3.l6 
in  the  case  of  division  operation,  to  produce  N  number  of  pulses  is  therefore 
given  by 

N 

T  =  E   T.     N  <  T  <  00  g  ._ 

n   .  .,   1        n  3.4  f 

i=l 

Theoretically,  T  ranges  from  N  and  infinity.   For  a  real  computer, 
integers  cannot  be  larger  than  a  limit  determined  by  the  particular  struc- 
ture of  the  computer.   For  the  APE  machine,  this  limit  is  K  =  2   .   Therefore 
Tn  actually  ranges  from  N  to  K.   The  normalized  value  of  T  ,  given  by 
t  =  -r-  ■>    corresponds  to  the  final  result  of  the  division  operation  in  the 
form  of  a  binary  fraction.   To  examine  the  statistical  properties  of  t  , 

it  is  better  to  start  from  the  distribution  function  of  T. .   Since  T  is 

1  1 

the  number  of  clock  periods  between  two  consecutive  pulses  in  the  SRPS  D 

n 

representing  the  number     d,   the  probability  that  T.    =  n   is   given  by  the 


probability  of  having  no  pulse  in  (n-l)  clock  periods  times  the  probability  of 
having  a  pulse  in  the  n   period.   One  can,  therefore,  write 

P(Ti=n)  =  d(l-d)n_1  3.1+8 

/    n  d 

Let  b  denote  (1-d)  and  D  be  the  differential  operator  ~  .   The  expectation 

value  of  T.  can  then  be  expressed  as 
1 

oo 

Exp[T.}  =  I      nd-b11*1 
1    n=l 


=  d  Z       D*bn 
n=l 


=  d'D  I      bn 

n=l 


=  d'D(r\-)  3.U9 


1-b' 


2 

The  mean  value  of  T.   is  given  by 

00 

Exp{Ti2}    =     Z      n2-d-bn-1 
n=l 

00  00 

=  d    Z     (n+l)n-bn_1   -  d     Z      n-bn_1      3.50 
n=l  n=l 

Upon   substituting  Equation  3.^9   into  the  2nd  term  of  Equation  3.50,   one  gets 

00 

Exp(T.2}    =   d     I      D2-bn+1   -  i 
n=l 

2  l 

=  -?-!  3-50 


3U 


The   standard  deviation  aT   is   given  by 


aT  =  Aar{T.} 


=   /Exp{T,2}   -  Exp2{T.}  3.50a 

By  substituting  Equations   3.^9   and  3-50   into  Equation   3.50a,    one  gets 


CTrji 


-J2-. 

d2 

1  _ 
d 

1 

'd2 

=  A-'. 

d2 

1 

d 

3.51 


The  mean  and   standard  deviation  of  t      can  now  be  obtained  as   follows 

CO  OO  CO  1» 

Exp{tw}=77       E  E      ...      E        I      T    'P(T        T,    ...,    T    )        3.52 

N  K  T1=l  T2=l  TK=1   i=l      X  1        2  N 

Since  T.    and  T.5    i  4   j9   are   independent   random  variables 
-1-  J 

P(TX,   T2,    ...,   TN)   =  P(T1)P(T2)    ...    P(TN)  3.53 

Together  with  the  fact   that 


I     P(T. )   =  1        for  all  i  3.5^ 

T.=l  X 

1 


Equation  3.52  becomes 


N 
Exp{tN}   =  |     E      Exp{T.}   =  N-|  3.55 

i=l 


Wow  the  variance  of  t   can  be  expressed  in 


Var{tN}  =  Exp[(tN  -  Expf^} )  2J  3.56 
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Combining  Equations  3.55  and  3.56  one  obtains 

N       N 
Var{tN)  =  Expt^  (  I      T   -  E   Exp{T.}) 
K   i=l  1  i=l 


2- 


1        N  . 

=  ^Exp{[  E   (T.  -  Exp(T.};l   > 
K       i=l    1        X 

N   N 
=  ~  Exp  E   E   [T.-Exp(T.}  ][T  .  -Exp(T.>  ]   3.57 
K      i=l  j=l   X  x     J       J 

Because  of  the  fact  that  T.  and  T.,  for  i  ^j  ,  are  statistical  independent  and 

CO 

that.  Z_  P(T.)  =  1,  the  expectation  operator  and  the  summation  operators  in 

Equation  3.57  can  be  shown  to  be  commutative,  i.e., 

N   N 
Var{t  }  =  i-  E   E   Exp{ [T.-Exp(T. ) ] [T .-Exp(T . ) ] }   3.58 
K  i=l  j=l       i      i    J      J 

Again  because  T.  and  T.,  for  i  ^  j,  are  independent,  all  terms  under  the 
summation  sign  with  i  i   j  in  Equation  3.58  are  zero.   There  Equation 

3.58  becomes 

1   N  2 

Varft  }  =  i-  E   Exp{(T  -Exp(T.r)} 

K  1*1 

1   N 
=  is-  E  Var{T  }  3.59 

K  i=l 
Because  all  the  T.'s  have  identical  distribution,  they  therefore  have  identical 
variance.   Consequently  Equation  3.59  becomes 

Var(tN}  =  ^7  Var{T.}  3.60 

it  follows  that  the  standard  deviation  is  given  by 


_  K 

\  K 


3.61 
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Combining  Equations  3.51  and  3.6l,  one  gets 


N 


JWWAd2     d 


3.62 


Recall  that  the  division  operation  is  performed  by  counting  the  pulses  of 

an  SRPS  D  representing  the  divisor  I  1-2x1  ,  shown  in  Equation  3.l6,  over  a 

n 

period  such  that  the  final  reading  of  the  counter  in  binary  fraction  equals 
to  the  dividend  I x-y I  .   The  term  (— )  in  Equation  3.62  is  the  normalized  value 


'K 


N 


of  the  number  of  pulses  being  counted.   Therefore  the  term  —  actually 

corresponds  the  dividend  | x-y|  in  the  division  operation.   The  parameter 

d  is  the  divisor  in  machine  number.   It  has  the  value  of  |  l-2x|  .   After 

replacing  d,  and  —  with  the  expression  in  terms  of  x  and  y,  Equation  3.62 
K 

becomes 


at  = 


x- 


1 


1 


1.63 


i  !  I  ■-: '  -,r     I  l-2x| 

The  relation  between  a,  e  and  K  according  to  the  Tchebycheff ' s  inequality 
can  be  obtained  by  combining  Equations  3.63  and  3.27.   The  result  is  given 
by 

lx-yl        /  1 


K 


3.6U 


(l-a)e       M  l--2.x.|  '"         !  1-  2r| 
Based  on  the   central  limit  theorem,  the  relation  between  a,   £    and  K  can  be 
approximated  by 


a  =  ERF 


033 


K 


x-y  I 


x-y  I 


3.65 


3.7. 5  Statistical  Properties  of  the  Result  of  Cascading  Two  Operations 

The  statistical  properties  of  the  basic  arithmetical  operations  have 
been  examined.   They  are  investigated  under  the  condition  in  which  the  inputs 
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to  the  stochastic  processor  are  deterministic.   Now  the  problem  of  probabilistic 
inputs  is  to  be  investigated.   This  is  the  case  when  two  levels  of  stochastic 
processors  are  operated  in  cascade,  like  the  example  shown  in  Figure  2.2.   The 
stochastic  processors  in  the  second  level  are  driven  by  the  outputs  of  the 
first  level  ones.   These  APE  outputs  are  the  sampling  means,  instead  of  the 
means,  of  the  output   SRPSs  of  the  first  level  APEs  and  are  therefore  random 
variables.   Let  the  outputs  of  two  first  level  APEs  be  denoted  by  X  and  Y. 
They  are  random  variables  with  normal  distribution,  according  to  central  limit 
theorem,  as  discussed  above.   When  X  and  Y  are  fed  into  another  APE  for  further 
processing,  the  result  can  be  represented  in  general  by 

Z  =  f(X,  Y)  3.66 

Theoretically,  the  statistical  properties  of  Z  can  be  derived  from  those  of 
X  and  Y.   Unfortunately,  even  for  some  rather  simple  function  f(X,  Y),  to  obtain 
the  result  in  explicit  closed  form  becomes  very  much  involved.   A  direct 
approach  to  this  problem  is  to  find  out  the  distribution  of  Z  from  those  of 
X  and  Y  and  from  f(X,  Y).   However,  the  following  approach  dodges  the  evalua- 
tion of  the  distribution  of  Z  and  simplifies  the  problem  somewhat.   To  com- 
pute the  expectation  value  of  Z,  one  writes 

I    1 

Z  =j     f(X,  Y)P(X,  Y)  dX-dY  3.67 

0  0 

where  P(X,  Y)  is  the  joint  distribution  function  of  the  input  variables.   For 

the  standard  deviation  a?  one  can  first  find  out  Z  ,  the  simplified  notation 

p 
for  Exp[Z  ] ,  by 

1  1 

Z2  =    f2(X,  Y)P(X,  Y)dX*dY  3.68 

'  0  0  — 

2    2   2 

and  makes  use  of  the  expression  a_  =  Z  -  Z  3.69 

Li 
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The  standard  deviation  a  is  then  given  by 


11,  11  o 

az  =  /{/7f  (X,Y)P(X,Y)dX'dY}  -  j/yAf  (X,Y)P(X,Y)dX-dY}  *  . 

0  0  Ot)  3.70 

This  equation  indicates  the  fluctuation  of  the  result  due  to  that  of  the 
inputs.   For  example,  if  the  operation  is  multiplication  with  the  multipli- 
cand and  the  multiplier  given  by  the  sampling  means  of  two  SKPSs  and  with 
the  product  in  the  form  of  another  SRPS ,  the  mean  value  of  the  product  SRPS , 
instead  of  being  a  deterministic  number  representing  the  exact  value  of  the 
product,  is  a  random  variable  and  fluctuates  with  a  standard  deviation  given 
by  Equation  3-70.   The  relation  between  the  confidence  level,  maximum 
allowable  error,  and  standard  deviation  as  given  in  3.33  is  apparently 
applicable  here.   Let  the  confidence  level,  error  and  standard  deviation 
associated  with  the  fluctuation  of  the  mean  value  of  the  output  random  vari- 
able due  to  the  input  fluctuation  be  a  ,  e   and  a     respectively.   Then  the 

Zi    A  A 

relation  between  them  is  given  by 

a  =  ERF(-— ; }  3.71 


VT 


Gz 


Let  the  confidence  level,  error  and  standard  deviation  associated 

with  the  estimation,  with  a  finite  sample  size  of  K  samples,  of  the  output 

variable  having  deterministic  mean  value  be  a  ,  e   and  a    .   Their  relation 

K   K      K 

has  been  derived  before  and  is  given  by 

a  =  ERF{ — }  3.72 

K       /2  a 

K 

Now,  let  the  confidence  level  and  error  due  to  the  combination  of  the  two 
cases  mentioned  before  be  n^,    c  -.   It  is  not  difficult  to  see  that 


T         Z  K 

Combining  Equation  3-71,    3. 72   and  3.73  one   gets 
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£       £       £ 

+  v  3.73 


a   =  ERF{-— -^-}{ERF  — ^-}  3.7^ 

•  2  a       /2  a 

Z  K 

As  an  example,  let  us  consider  an  addition  operation  with  the  addend  and  augend 

coming  from  the  outputs  of  other  APEs.   According  to  the  previous  notation, 

the  addend  and  augend  are  denoted  "by  (X  )   and  (Y  )   to  indicate  that  they  are 

n  K       n  K 

obtained  from  SRPSs  X  and  Y  by  averaging  over  K  clock  periods.   In  this 

n      n 

section,  the  subscripts  are  dropped  to  simplify  the  notation  so  that  (X  )   is 

n  K 

now  denoted  by  X  and  (Y  )„,  by  Y.   The  addition  operation  is  therefore  given 

n  is. 


by 


f(X,Y)  =Z=X+Y-|  3.75 


To  find  a   according  to  Equation  3.70,  one  must  first  find  out  P(X,Y).   Since 

Li 

X  and  Y  are  independent  random  variables,  one  can  write 

?(X,Y)  =  P(X)-P(Y)  3.76 

By  substituting  Equation  3.28  into  3.76,  one  gets 

_[(x-yx)2  +  (Y-My)2 

P(X,Y)  =  0  X   e  L  2nX  2aY  J  3.77 


2tto 


X°Y 


where  y   and  y   are  the  mean  values  of  X  and  Y  respectively  and  av  and  a 

AX  AX 

are  the  standard  deviations  of  X  and  Y  respectively.   Upon  substituting 
equations  3.75  and  3.77  into  Equation  3.67,  one  gets 


Uo 


C/    .-   )*  (Y-yy)2 


V 


2  a, 


2  a 


1  1 

r  r 


z  -J  J 

o  o 


(X  +  Y  -  i)' 


2    27Taxax 


dXdY 


y     +  y 

X  Y 


Similarly,  one  can  compute  the  expectation  value  of  Z  ,  by 


(X-^)2    (Y-uy)2 


z2  = 


1  1 


2  a 


X 


2o 


Y 


lv2. 


0  0 


(X  +  Y  -  |)  "2rra^JY 


dXdY 


a^2  +  y  2  +  av2  +  yv2  +  \  +  2yyyY  -  yy  -  y. 


JX 


'X     Y 


k        "MXMY    MX   HY 


The  standard  deviation  a   is  therefore 

LA 


3.78 


3.79 


a  =  /  2   -2 


3.79A 


Upon  substituting  Equations  3.78  and  3.79  into  Equation  3.79A  one  finds  that 
many  terms  in  the  righchand  side  of  Equation  3.79A  cancel  out,  so  that 

3.80 


/  2     2 

=  vo        +   a 


Z     X     Y 
Combining  Equations  3.80  into  3.7^+  one  arrives  at 


aT  =   ERF 


srrr^y 


ERF 


K 


^c 


3.81 


'X    UY        ""  ~K 
Recall  that  cu  represents  the  fluctuation  of  the  estimation  of  the  mean  value 
x  of  the  random  variable  X  by  averaging  over  K  samples.   According  to 
Equation  3.20 


Ill 


/x(l  -  X 
/    K 


ax  ,- 


However,  x(l  -  x)  is  always  less  than  r\   Therefore,  one  can  write 


2   1 


av  <  —  3.83 


Similarly 

aY2<  fe  3'8U 

a  2  <  1  o 

K  '  UK  3'85 

Upon  substituting  these  expressions  into  Equation  3.8l,  it  becomes, 

a   >  ERF[e_/K]  -ERF[e  /2K]  3.86 

1         Zi  K 

As  an  example,  let  K  =  13  x  10  ,  c      =   0.5%,    e      =   0.5%   so  that  e   =  1%, 

L  K  1 

one   gets 

ctT  >   ERF{1.80}ERF{2.55>  3.87 

>    0.988 

U 
In  other  words,  if  integration  is  taken  over  K  =  13  x  10  number  of  clock 

periods,  the  output  reading  of  this  cascaded  operation  would  be  within  1%   of 

its  correct  result  with  98.8%  confidence. 

3.7-6    Statistical  Properties  of  the  Results  of  Differential  and  Integral 
Operations 

The  differential  operation  is  actually  a  subtraction  operation. 

Therefore  all  the  statistical  properties  derived  for  subtraction  operation 

are  applicable.   The  statistical  properties  of  the  result  of  the  integral 

operation  are  similar  to  those  of  the  addition  operation  with  nondeterministic 

inputs.   Therefore,  the  results  derived  in  Section  3.7.5  must  be  applied  to 

compute  the  relation  between  confidence  level,  accuracy  and  sampling  time. 
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4.   THE  GENERATION  OF  THE  SYNCHRONOUS  RANDOM  PULSE  SEQUENCES  ; 

l+.l      General  Requirements  on  the  SRPSs  Used  in  the  APEs 

There  are  four  basic  requirements  on  the  SRPSs  used  in  the  APEs: 
a)   It  has  been  shown  in  the  previous  chapters  that  numbers  represented  by 
SRPSs  can  be  operated  upon  with  great  ease.   However,  the  significance  of 
this  advantage  depends  on  the  ease  with  which  a  number  can  be  converted  to 
and  from  an  SRPS.   Therefore,  the  conversion  must  be  able  to  be  accomplished 
with  simple  circuits,   b)   Most  of  the  operations  of  the  APEs  involve  two 
operands.   It  is  therefore  required  to  have  two  SRPSs  representing  two  dif- 
ferent operands.   These  two  SRPSs  must  be  statistically  independent  as 
required  in  the  multiplication  operation.   c)   The  APEs  obtain  the  power 
remotely.   The  maximum  power  available  to  an  APE  is  less  than  100  mW.   There- 
fore, the  converter  must  consume  very  little  power.   It  is  this  fact  that 
precludes  the  well-tested  method  of  converting  a  number  into  SRPS  by  means 
of  noise  diodes  and  thresholding  technique,   d)   Because  the  number  to  be 
converted  might  be  a  function  of  time,  the  resulting  SRPS  must  be  able  to  fol- 
low this  change  and  reach  a  new  steady  state  within  a  small  fraction  of  a 
computing  cycle. 


4.2 


Conversion  of  a  Number  into  a  SRPS 


The  basic  idea  of  converting  a  number  into  its  SRPS  representation 
is  to  compare  the  number  with  the  output  of  a  noise  source.   There  are  many 
approachs  to  the  conversion  based  on  different  ways  to  make  comparisons  and 
different  principles  to  generate  noise^12^13^   ^15').   The  one  most  suitable 
for  the  APE  machine  is  shown  in  Figure  4.1,  The  number  to  be  converted  is 
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in  binary  representation  and  denoted  by  X  .   This  number  is  stored  in  a 
binary  register.   The  length  of  the  register  is  10  bits  for  the  APE  machine. 
This  10-bit  binary  number  is  compared  with  the  output  of  a  10-bit  random 
number  generator  continuously.   The  random  number  generator  produces  a  new 
10-bit  binary  number  at  the  beginning  of  every  clock  period.   The  10-bit 
random  binary  number  has  a  uniform  distribution  function.   The  comparator 
output  is  '1'  if  the  10-bit  binary  random  number  is  smaller  or  equal  to 
the  10-bit  binary  number  X  .   It  is  a  '0'  otherwise.   Because  the  random 


number  X  is  uniformly  distributed,  the  output  of  the  comparator  is  a  SRPS 

r 

with  the  probability  of  occurrence  of  a  '  1'  in  each  clock  period  equal  to  X 


m 


I+.3      Generation  of  a  10-bit  Pseudorandom  Binary  Number  with  Uniform 
Distribution 

It  is  rather  involved  to  generate  a  truly  random  number  with  pre- 
determined statistical  properties.   However,  certain  types  of  well-defined 
periodic  signals  could  appear  to  be  random  if  the  periods  of  the  signals  are 
much  longer  than  the  observation  period.   One  example  of  this  type  of 
signal  is  the  pseudorandom  binary  sequence.   It  can  be  generated  with  an 
extremely  long  period  and  with  very  simple  hardware.   The  numbers  are  gen- 
erated by  modulo-2  linear  recurrence  technique  long  used  to  generate 
binary  code  for  communications.   In  this  method,  a  shift  register  of 
n  stages  is  connected  with  a  feedback  path  as  shown  in  Figure  k.2.      If  the 
function  f^,  x^,    ...,  x  )  can  be  expressed  as  a  linear  combination  of  the 
output  of  various  stages  of  the  register,  this  type  of  circuit  is  referred 
to  as  linear  feedback  register.   If  the  register  is  initially  in  a  state  where 

Xl  :=  ai  »  XP  =  ap'  '••»  xr.  "  av, »  the  nex"t  state  of  register  is  completely 
determined  and  can  be  expressed  by 


n  -  Stage   Shift  Register 
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Figure  k.2       A  General  Linear  Feedback  Shift  Register  Arrangement 
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Figure  U.3   The  Arrangement  for  the  Generation  of  a  20-Bit 
Maximum  Length  Sequence 
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Xl  =  C1X1  +  °2X2  +  *  *  *  +  CnXn 


x.  =  x.-l      i  =  2,  3,  ...  ,  n 

1    i 


k.l 


,  th 

where  x  denotes  the  current  value  of  the  i   stage,  x.  denotes  the  value 
i  x 

of  the  i   stage  during  the  last  clock  period.   This  means  that  a  new  n-bit 
binary  number,  related  to  the  last  one  by  Equation  k.l,    is  generated  by 
the  register  every  clock  period. 

It  is  obvious  that  a  different  set  of  coefficient  c.  corresponds 
to  a  different  sequence  of  n-bit  binary  number  produced  by  the  register. 
With  an  appropriate  set  of  coefficients,  the  length  of  the  sequence  before 
it  repeats  itself  can  be  made  to  be  2  -  1,  which  is  only  one  less  than 
the  maximum  number  of  states  a  n-stage  register  can  possibly  be  in.   For  a 
20  stages  feedback  shift  register,  this  corresponds  to  a  sequence  of  more 
than  one  million  binary  numbers  before  any  periodicity  occurs.   It  can  be 
shown  that  this  maximal-length  sequence  with  a  period  of  2  -  1  can  be 

obtained  if  the  set  of  coefficient  c.'s  equal  to  the  coefficients  of  a  pri- 

th  (16) 

mitive  n   degree  polynomial  '    .   For  a  20  stages  feedback  register,  the 

set  of  coefficient  has  been  worked  out  in  reference  (IT).   The  result  is  that 

all  coefficients  equal  to  zero  except  c  „  =  con   =  1,  as  depicted  in 

Figure  U.3.   The  first  ten  bits  of  the  output  of  the  register  are  used  to  form 

10-bit  binary  number  while  the  remaining  ten  bits  are  used  to  form  another 

10-bit  binary  number.   If  these  two  binary  numbers  are  taken  once  after 

every  shifting  of  the  register  to  form  two  binary  number  sequences,  they  can 

be  shown,  by  following  a  similar  argument  as  presented  in  reference  (lU),  to 

be  two  statistically  independent  pseudo-random  binary  number  sequences  with 

uniform  distribution.   They  are  therefore  being  used  as  the  noise  sources 

for  the  APE  machine. 


hi 

k.h  The  Singular  State  of  a  Linear  Feedback  Shift  Register 

A  n-stage  feedback  shift  register  has  2  different  states.   A  maximal 
length  sequence  will  go  through  every  state  except  the  all-zero  state.   A 
close  look  at  Figure  U.3  reveals  that  once  a  linear  feedback  shift  register 
is  in  this  all-zero  state  it  remains  there  indefinitely.   If  this  is  the  case, 
the  feedback  shift  register  ceases  to  generate  pseudorandom  numbers.   There- 
fore, protective  circuitry  must  be  used  to  provide  continuous  monitoring  of 
the  feedback  shift  register.   If  it  ever  gets  into  this  all-zero  singular  state 
for  some  reason,  the  protective  circuit  must  be  able  to  pull  it  out.   A  simple 
circuit  to  do  this  is  outlined  in  Figure  h.k.      It  employs  a  5-stage  counter 
driven  by  the  same  clock  that  drives  the  shift  register.   The  reset  of  the 
counter  is  connected  to  the  output  of  any  stage  while  the  carry  output  of 
the  counter  is  added,  modulo-2,  to  the  output  of  an  arbitrary  stage.   The  sum 
is  then  fed  into  the  subsequent  stage.   If  the  X.'s  are  not  all  zero,  the 
feedback  shift  register  will  not  be  in  the  all-zero  state  under  proper  operat- 
ing conditions.   In  this  case,  the  reset  line  will  be  in  logical  ' 1'  level 
at  least  once  every  twenty  clock  periods.   As  a  consequence,  the  5-bit  counter 
will  be  reset  before  the  carry  bit  becomes  '1'.   Hence,  the  carry  output  is 
constantly  '0'.   The  output  of  the  modulo-2  adder  is  therefore  identical  to 
the  output  of  the  stage  denoted  by  X  .   In  other  words,  the  protective  circuit 
will  have  no  effect  on  the  operation  in  this  case.   However,  if  the  feedback 
shift  register  slips  into  the  all-zero  state,  the  carry-out  will  be  at  logical 
'1'  level  after  31  shifts  because  of  the  lack  of  the  reset  signal.   A  logical 
'1'  signal  in  the  carry-out  line  when  the  feedback  shift  register  is  in  the 
all-zero  state  results  in  inserting  a  '1'  into  X  +  1  stage  during  the  next 
clock  period.   Therefore,  the  feedback  shift  register  is  again  put  back  into 
a  non-singular  state. 
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1+.5      The  Binary-Number-to-SRPS-Converter  of  the  APE 

The  complete  binary-number-to-SRPS-converter  of  the  APE  is  illustrated 
in  Figure  h.5.      The  entire  converter  employs  a  20-stage  shift  register,  a 
5-stage  counter,  two  10-bit  comparators,  and  two  modulo-2  adders,  better 
known  as  EXCLUSIVE-OR  gates.   The  two  10-bit  registers  for  storing  the 
binary  numbers  are  actually  part  of  the  input  decoding  circuits.   Comple- 
mentary Symmetry  Metal  Oxide  Integrated  circuits  are  used  to  implement 
this  converter  because  of  their  extremely  low  power  comsumption.   The  overall 
power  consumption  of  the  converter  is  less  than  2  mW  when  it  is  operated 
at  a  150  kHz  clock  rate.   This  power  consumption  level  is  much  lower  than 
any  other  approach  could  possibly  attain.   The  dynamic  performance  of  this 
converter  is  also  very  good.   Whenever  there  is  a  change  in  the  number  to 
be  converted  into  SRPS,  the  mean  value  of  the  output  SRPS  following  the 
change  and  reach  steady  state  in  one  clock  period. 
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5.   CIRCUIT  DESCRIPTION  OF  THE  APEs 

5.1  Special  Design  Considerations  for  the  APEs 

Because  of  the  limited  power  available  for  the  APE, the  overriding 
factor  in  its  design  is  low  power  consumption.   An  original  estimate  of  the 
capability  of  remote  powering  of  diverse  systems  puts  the  ceiling  on  the 
power  consumption  of  an  APE  at  100  mW.   This  severe  limitation  on  power  con- 
sumption "brings  about  many  challenges  in  the  implementation  of  the  APEs  and 
affects  every  facet  of  the  circuit  design.   It  immediately  rules  out  the 
possibility  of  taking  advantage  of  many  commerically  available  communica- 
tion circuits.   In  fact,  all  communication  circuits  used  in  the  APEs  were 
developed  in  the  Digital  Computer  Laboratory  of  the  University  of  Illinois. 
For  logic  circuits  C0S/M0S  integrated  circuits  are  used  exclusively  because 
of  their  extremely  low  power  consumption.   In  the  design  of  the  communication 
subsystem  of  the  APE,  multiplexing  techniques  are  employed  whenever  possible 
to  reduce  the  number  of  receivers  and  transmitrers  required. 

5.2  Block  Diagram  of  the  APEs 

A  functional  diagram  on  an  APE  has  been  shown  in  Figure  2.3.   It  is 
presented  in  a  way  suitable  to  explain  how  the  APEs  function.   However,  in 
the  implementation  of  the  APE,  some  of  the  functional  blocks  are  merged 
together  while  others  are  realized  with  separate  physical  hardware.   From  the 
hardware  implementation  point  of  view,  the  block  diagram  of  an  APE  can  be 
depicted  somewhat  differently  as  shown  in  Figure  5.1.   There  are  two  tunable 
receivers  for  data  inputs.  However  one  of  ihese  tunable  receivers,  namely 
receiver  B  in  the  figure,  is  operated  on  a  time-sharing  basis  for  both  input 
data  reception  as  well  as  program  instruction  reception.   Furthermore,  this 
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receiver  is  operated  at  different  priority  levels  with  the  reception  of  the 
program  instruction  having  higher  priority  over  the  reception  of  input  data. 
Whenever  a  program  instruction  is  to  he  sent  out  from  the  program  control 
unit,  the  latter  is  first  switched  to  the  programming  mode.   This  causes 
the  clock  transmitter  to  stop  the  transmission  of  the  clock  signals.   The 
ahsence  of  the  clock  signals,  in  turn,  shuts  off  all  output  transmitters  of 
the  APEs .   As  a  result  the  frequency  channel  for  the  program  instruction  trans- 
mission is  clear  from  RF  interference.   The  control  of  the  tunable  receiver  B 
is  then  taken  over  for  the  transmission  of  program  instructions  and  the 
receiver  automatically  tunes  to  the  fixed  frequency  channel  assigned  to  that 
particular  type  of  APE  for  program  instruction  transmission,  as  denoted  by 
v...  in  Figure  5.1.   The  same  channel  frequency  \>      is  later  used  to  transmit 
output  data  when  the  APE  machine  is  in  the  execution  mode.   Because  the  APE 
transmitter  is  already  shut  off  and  no  data  output  is  sent  during  the  program- 
ming mode,  the  channel  is  used  exclusively  for  the  transmission  of  the  program 
instruction.   The  output  signal  of  the  time-shared  receiver  B  is  used  to  feed 
decoder  for  function  during  the  programming  mode  and  to  feed  the  data  decoder 
during  the  execution  mode.   The  program  instructions  are  sent,  one  at  a  time, 
to  all  types  of  APEs  involved  in  a  specific  program.   Upon  the  completion 
of  the  programming  process,  the  program  control  unit  is  switched  to  the 
execution  mode.   Note  that  a  program  instruction  contains  the  information  of 
what  channels  the  two  tunable  receivers  are  to  be  tuned  to  as  well  as  what 
specific  operation  the  stochastic  processor  is  to  perform!   In  the  execution 
mode,  the  control  of  the  tunable  receiver  B  is  returned  to  the  APE  for  data 
input  reception.   Like  the  tunable  receiver  A,  tunable  receiver  B  is  now  tuned 


according  to  the  program  instruction.   A  multiplexing  technique  is  also 
employed  to  operate  the  transmitter,  which  is  used  to  transmit  the  output 
data  and  the  reply  to  the  test  signal  from  the  program  control  as  veil. 
During  the  execution  mode,  the  transmitter  is  used  for  output  data  trans- 
mission.  It  is  otherwise  used  for  replying  to  the  test  signal  during  the 
test  mode. 

The  multiplexer  control  signal  is  sent  to  the  APE  from  the  clock 
channel.   A  continuous  presence  of  an  RF  signal  in  the  clock  channel  tells 
the  APE  to  operate  in  the  testing  mode.   For  programming,  as  mentioned  before, 
no  RF  signal  is  sent  through  the  clock  channel.   In  the  execution  mode,  a 
sequence  of  RF  pulses  carrying  the  timing  information  is  sent  through  the  clock 
channel. 
5.3      The  Timing  Circuit  of  the  APEs 

All  operations  of  the  APE  are  performed  in  synchronism  with  a  com- 
mon time  reference.   A  composite  clock  carrying  the  clock  signal  and  the 
synchronizing  signal  is  sent  from  the  APE  control  unit  through  the  clock 
channel  at  U2.5  MHz.   The  clock  signal  has  a  frequency  of  165.^50  KHz 
and  the  synchronization  signal  uses  1  Hz.   The  APEs  begin  to  communicate  with 
each  other  at  the  occurrence  of  the  synchronization  signal.   The  communica- 
tion period  lasts  —  of  a  second.   Immediately  afterwards,  the  computation 

Q 

begins  and  lasts  for  —  of  a  second.   Then  comes  the  next  synchronizing  signal 
and  a  new  computing  cycle  begins.   The  waveform  of  a  composite  clock  signal 
is  illustrated  in  Figure  5.2.   The  clock  signal  is  carried  by  the  regular 
pulse  while  the  synchronizing  signal  is  carried  by  two  consecutive  wider  pulses, 
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Three  timing  signals  are  required  by  the  APE.   They  are  the  synchron- 
zing  signal,  delayed  reset  signal  and  the  preset  signal.  The  synchronizing 
signal,  also  called  reset  signal,  appears  at  the  beginning  of  a  computing 
cycle  whereas  the  delayed  reset  appears  one  ninth  of  a  second  later  and  marks 
the  end  of  data  transmission  and  the  beginning  of  computation.   The  preset 
signal  occurs  at  about  3  msec  before  the  next  synchronizing  signal  does,  and 
is  used  to  set  up  the  transmitter  just  before  it  is  used  to  transmit  the 
output  data.   The  detail  discussion  on  the  generation  of  these  timing  signals 
is  given  below. 
5.3.1    Digital  Synchronizing  Signal  Separator 

This  part  of  the  timing  circuit  separates  the  synchronizing  signal 
from  the  composite  clock.   This  is  done  by  triggering  a  one-shot  circuit  with 
the  composite  clock.   The  output  of  the  one-shot  is  a  sequence  of  pulses  with 
a  fixed  width  which  is  wider  than  that  of  the  clock  pulse  but  narrower  than 
that  of  the  synchronizing  pulse,  as  illustrated  in  Figure  5.3.   The  composite 
clock  pulse  sequence  and  the  output  pulse  sequence  of  the  one-shot  are  com- 
pared by  feeding  them  into  the  data  input  and  the  clock  input  of  a  D-type 
flipflop  respectively.   The  output  of  the  D-type  flipflop  for  the  next  clock 
period  is  equal  to  the  value  of  the  D  input  at  the  time  the  clock  input  C 
changes  from  a  '0'  to  a  '1'.   By  comparing  the  waveform  of  the  composite 
clock  and  the  output  of  the  one-shot  as  shown  in  Figure  5.3,  it  is  easy 
to  see  that  the  Q  output  of  the  flipflop  is  a  '1'  when  the  synchronizing 
signal  occurs  and  is  a  *0'  otherwise.   Therefore  the  Q  output  of  the  flipflop 
contains  only  the  synchronizing  signal  and  is  used  for  resetting  the  input 
storage  at  the  beginning  of  every  computing  cycle. 
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Figure  5.3   Synchronizing  Signal  Separator  and  its  Waveform 
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5.3.2    The  Delayed  Reset  Signal  and  the  Preset  Signal  Generator 

A  delayed  reset  signal  is  required  to  signal  the  end  of  data  trans- 
mission and  the  "beginning  of  data  processing.   The  delayed  reset  occurs 
16,381+  clock  periods  after  the  reset,  or  approximately  —  of  a  second  after 
the  "beginning  of  a  computing  cycle.   The  preset  signal  is  used  to  turn  on 
the  output  transmitter  about  3  msec  "before  the  "beginning  of  data  transmission, 
The  circuit  used  to  generate  these  signals  is  shown  in  Figure  5.^.   It 
employs  an  l8-"bit  counter  with  the  reset  input  driven  by  the  synchronizing 
signal.   At  the  beginning  of  a  computing  cycle,  the  l8-bit  counter  is  reset 
to  the  all-zero  state,  and  the  delay  reset  output  is  also  set  to  '0'  by 
the  synchronizing  signal.   Then,  the  counter  starts  to  count  up.   Once.it 
reaches  l6,38H  counts,  the  output  of  the  15th  bit  changes  from  '0'  to  '1', 
causing  the  delay  reset  to  change  from  '0'  to  '1'  and  to  stay  there  afterwards, 
(due  to  the  latch  action  of  the  S-R  flipflop),  until  the  next  reset  signal 
comes  along.   The  preset  signal  is  set  to  '0'  at  the  beginning  of  a  computing 
cycle.   It  changes  to  *1'  when  the  counter  counts  up  to  lU6,9Uli,  i.e.  511 
clock  periods  or  approximately  3  msec  before  the  next  reset  pulse  appears. 
5.4      Mode  Control  Signal  Detector 

As  mentioned  earlier,  the  APE  can  be  operated  in  three  different 
modes,  namely,  the  programming  mode,  the  execution  mode,  and  the  test  mode. 
During  the  programming  mode,  the  output  transmitter  of  every  APE  is  shut  off 
and  a  data  input  receiver  is  converted  into  the  instruction  receiver.   For 
the  execution  mode,  the  output  transmitter  is  active  and  the  instruction 

eiver  is  converted  back  to  an  input  data  receiver.   The  signal  to  control 
the  mode  of  operation  is  being  transmitted  to  the  APEs  through  the  clock 
lannel.   For  the  execution  mode,  a  composite  clock  is  transmitted.   In  the 
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case  of  the  test  mode,  a  continuous  unmodulated  RF  signal  is  transmitted.     - 
For  the  programming  mode,  the  clock  transmitter  is  turned  off  completely. 

The  circuit  for  detecting  these  mode  control  signal  is  shown  in 
Figure  5.5.   The  circuit  employs  two  RC  timing  circuits  controlled  by  two 
diodes  separately.   Point  A  can  be  charged  to  a  logical  '1'  level  much  more 
easily  than  discharged  back  to  a  logical  '0'  level:   As  a  result,  point  A 
is  at  a  logical  level  '0'  only  during  the  programming  mode.   The  output  P  is 
therefore  at  a  logical  '1'  level  only  during  this  mode.   By  applying  the  similar 
argument,  point  T  is  visibly  at  a  logical  '1'  level  only  for  the  test  mode. 
The  execution  mode  is  identified  by  the  logical  '0'  level  in  both  T  and  P. 
5. 5      The  Function-Decoder  and  Channel  Multiplexer 

The  function  of  this  decoder  is  to  decode  and  store  the  instruction 
from  the  control  unit.   The  instruction  is  coded  in  a  binary  pulse-width- 
modulated  sequence  28  bits  long,  the  first  six  bits  carrying  the  operation 
code  6.   The  remaining  22  bits  carry  two  tuning  information  sequences  with 
11  bits  each  used  for  the  two  tunable  input  data  receivers.   The  decoding  of 
this  instruction  is  done  with  a  network  shown  in  Figure  5.6.   When  the  APE 
machine  is  in  the  programming  mode,  the  inverted  P  signal  from  the  mode  con- 
trol signal  detector  is  at  a  logical  '0'  level.   This  causes  the  signal 
at  point  A  to  be  gated  into  point  B,  as  well  as  setting  the  analog  switch 
to  the  programming  mode  position.   The  resistor  R  in  front  of  the  C0S/M0S  NOR 
gate  is  used  to  perform  noise  discrimination.   By  adjusting  the  value  of  R, 
one  could  set  the  threshold  on  the  input  signal  pulse  width  such  that  any 
narrower  pulse  will  be  rejected.   (Normally  noise  induced  by  transient 
phenomena  is  in  the  form  of  narrow  spikes!)   After  getting  through  the  NOR 
gate,  the  instruction  signal  is  decoded  by  comparing  the  width  of  each  puis? 
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with  that  of  a  standard  pulse  in  a  sequence  from  the  one-shot  circuit.   The 
comparison  is  accomplished  "by  feeding  the  pulses  into  the  data-  and  clock-inputs 
of  a  static  shift  register  as  shown  in  the  diagram.   When  a  narrow  pulse 
in  the  instruction  signal  comes  along,  the  data  input  of  the  register  will  be 
at  a  logical  '0'  level  by  the  time  the  clock  input  changes  from  a  logical 
'  0'  level  to  a  logical  '1'  level.   The  register  then  receives  a  logical  '0' 
input.   When  a  wide  pulse  in  the  instruction  signal  comes  along,  a  '1'  is 
received.   In  this  way,  a  sequence  of  pulse -width-modulated  pulses  is  decoded 
and  stored  in  the  form  of  a  sequence  of  binary  numbers.   The  parts  of  the 
instruction  carrying  the  tuning  information  are  then  sent  to  two  non-linear 
D/A  converters.   This  results  in  two  analog  tuning  voltages  custom-matched 
to  the  tuning  characteristics  of  the  individual  tuning  diode.   An  analog 
gate  is  employed  to  switch  one  of  these  analog  tuning  voltages  to  tune 
receiver  B  in  the  execution  mode  and  to  switch  a  fixed  voltage  for  tuning  the 
receiver  B  for  instruction  reception  in  the  programming  mode. 
5.6      Input  Duty-Cycle  Decoder 

Each  APE  has  two  input  duty-cycle  decoders,  one  each  for  input  X 
and  input  Y.   The  function  of  these  decoders  is  to  convert  the  machine  number 
in  duty-cycle  modulation  into  a  10-bit  binary  number  and  to  store  the  result. 
The  decoder  consists  of  a  10-bit  up-counter  with  a  gated  input.   A  clock 
signal  is  gated  to  the  input  of  the  10-bit  up-counter  by  the  data  input  signal 
in  duty-cycle  modulation,  as  shown  in  Figure  5.7.   Therefore,  the  data  input 
is  measured  against  the  clock  and  the  data  input  signal  width  is  registered 
in  the  counter  in  terms  of  the  number  of  periods  of  the  clock. 
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Figure  5.7   The  Input  Data  Decode: 
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5.7  The  Stochastic  Processor 

After  the  inputs  X  and  Y  are  decoded  and  stored  in  "binary  numbers, 
they  are  processed  "by  the  stochastic  processors  according  to  the  operation 
code  9  from  the  instruction  decoder.   The  stochastic  processor  consists 
of  two  binary-to-SRPS  converters  and  the  processing  network,  which  have 
already  been  discussed  in  Chapter  3  and  Chapter  h.      They  are  not  repeated 
here. 

5.8  The  SRPS  Integrator  and  the  Output  Encoder 

The  function  of  the  SRPS  integrator  is  to  convert  the  result  of  the 
stochastic  processor  in  SRPS  representation  into  "binary  representation  for 
encoding  into  pulse  width  modulated  signal.   The  integrator  and  the  encoder 
are  implemented  with  an  up-down-counter  as  shown  in  Figure  5.8.   The  delayed 
reset  changes  from  a  logical  '0'  level  to  a  logical  ' 1'  level  at  the  end 
of  —  of  a  computing  cycle,  triggering  the  one-shot  circuit  to  send  out  a 
pulse.   This  pulse  presets  the  counter  according  to  the  preset  input:   In 
the  case  of  addition  and  subtraction,  the  preset  input  is  a  binary  fraction 
of  one  half.   It  is  otherwise  all  zero  for  multiplication  and  division.   For 
storage  the  preset  input  equals  the  number  to  be  stored.   For  any  operation 
other  than  storage,  the  counter  is  set  to  count  up  (right  after  it  is 
preset  by  the  delayed  reset  pulse)  and  continues  to  do  so  until  the  end  of 
the  computing  cycle.   During  the  up-count  period,  it  is  driven  by  the  output 
of  the  stochastic  processor.   At  the  end  of  the  up-count  period  (which  is 
also  the  end  of  the  computing  cycle),  the  content  of  the  up-down-counter 
represents  the  result  of  the  stocahstic  processor  in  binary  representation; 
it  is  ready  to  be  encoded  and  transmitted  at  the  beginning  of  the  next 
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computing  cycle.   For  the  storage  operation  the  up-down-counter  is  preset 
to  the  value  of  the  number  to  "be  stored  "by  the  delayed  reset  pulse.   This 
number  remains  in  the  counter  until  the  end  of  the  computing  cycle. 

At  the  beginning  of  the  next  computing. cycle,  the  counter  is  set 
to  count  down.   At  the  same  time,  the  output  clock,  replacing  the  output  of 
the  stochastic  processor,  is  used  to  drive  the  up-down-counter.   The  count- 
down operation  continues  until  the  counter  is  empty.   It  is  obvious  that 
the  period  over  which  the  up-down-counter  counts  down  corresponds  to  the 
initial  number  in  the  counter,  which  is  the  number  to  be  stored  in  the  case 
of  storage  operation  and  the  estimate  of  the  result  of  the  stochastic  pro- 
cessor in  the  case  of  other  operations.   Furthermore,  the  output  represented 
by  the  length  of  the  count-down  period  is  already  in  the  form  of  duty-cycle 
modulation.   It  can  be  used  immediately  to  modulate  the  output  transmitter. 
5.9      The  Communication  Subsystem  of  the  APE 

The  APE  transmits  and  receives  all  types  of  information  through 
radio  frequency  channels  only.   Each  APE  is  equipped  with  two  tunable 
receivers,  a  clock  receiver  and  a  transmitter  for  communication  with  other 
units  of  the  APE  machine.   All  these  communication  subsystems  must  be  of  the 
micropower  type.   This  precludes  the  use  of  many  commercially  available 
communication  circuits.   They  are  built  in  fact  entirely  with  discrete  compon- 
ents.  In  the  following  sections,  they  are  discussed  in  detail  separately. 
5.9.1    The  Remotely  Tunable  Data  Receiver 

Although  receiver  design  is  thoroughly  understood,  it  remains  a 
very  time-consuming  process  to  design  and  develop  a  good  one  for  a  specific 
application.   Of  the  numerous  considerations  given  to  the  design  of  the 


receiver,  only  the  important  ones  shall  be  discussed  below.   The  basic 
requirements  of  this  receiver  are  as  follows: 

1)  micropower  consumption,  with  a  maximum  drain  equal  or  less  to 
2.5  ma  from  a  6  volt  source, 

2)  remote  tunable  feature  for  receiving  pulse  width  modulated 
RF  from  any  of  the  eleven  channels  covering  a  frequency  band 
of  5.300  MHz, 

3)  a  bandwidth  of  about  10  KHz  to  resolve  a  100  msec  input  pulse, 
k)        a  sensitivity  of  500  microvolts, 

5)    a  selectivity  of  at  least  "60  db  down  "  for  the  nearest  channel 
Various  types  of  receiving  principles  and  techniques  have  been 
investigated  for  the  implementation  of  this  receiver.   It  was  found  that  the 
heterodyne  receiver  is  the  most  suitable  because  of  its  high  sensitivity, 
high  selectivity  and  simplicity  in  tuning.   However,  it  is  also  well  known 
that  image  frequency  response  and  spurious  response  are  the  inherent  short- 
comings of  this  type  of  receiver.   Although  more  sophisticated  types  of 
receiver  design,  such  as  superheterodyne  and  double  conversion  hetero- 
dyne types,  could  overcome  these  shortcomings,  they  would  easily  exceed 
the  limit  on  the  power  consumption.   Happily  enough,  proper  reception  with 
a  heterodyne  receiver  is  still  possible  if  channel  frequencies  are  assigned 
carefully,  so  that  no  channel  frequency  lies  near  a  major  spurious  response 
frequency.   Originally,  it  was  planned  to  use  a  powerful  microwave  source 
at  1296  MHz  to  transmit  the  electric  power  to  the  APEs.   To  avoid  interference, 
the  frequency  of  the  data  channels  are  chosen  to  be  as  far  down  below  the 
1296MHz  as  possible.   On  the  other  hand,  the  range  of  tuning  must  be  wide 
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enough  to  cover  all  channels.   For  a  given  timing  range,  an  increase  in  the 
frequencies  of  the  band  would  decrease  the  tuning  ratio  and  leads  to  simpler 
tuning  circuits.   Therefore  it  is  preferrable  to  use  higher  channel  frequencies 
from  the  tuning  point  of  view.   Other  factors,  such  as  the  spurious  frequencies 
and  the  availability  of  the  components  also  affect  the  frequency  assignment. 
A  good  compromise  solution  is  to  have  the  frequencies  assigned  between  15  MHz 
and  20.3  MHz,  with  the  IF  frequency  at  10.811  MHz  and  with  the  channel  spacing 
being  530  KHz.   Within  the  tuning  range  of  the  readily  available  10.7  MHz  IF 
transformer  for  commercial  FM  receivers,  the  10.811  MHz  is  chosen  to  be  the 
IF  because  it  minimized  the  interference  of  the  spurious  response.   This  can 
be  best  shown  by  examining  the  spurious  response  in  more  detail.   Spurious 
responses  are  generated  when  harmonics  of  the  input  signal  beat  with  the 
local-oscillator  signal  (or  its  harmonics)  to  produce  a  signal  close  to  the 
intermediate  frequency  (or  its  subharmonics) .   To  see  how  these  higher  har- 
monics mix,  one  could  expand  the  transfer  characteristics  of  the  mixer  tran- 
sistor about  the  operating  point  by  means  of  Taylor  series 

Where  I  denotes  the  total  collector  current 
c 

Ifi  denotes  the  collector  bias  current 
V   denotes  the  base  to  emitter  voltage 

v   denotes  the  AC  component  of  the  base  to  emitter  voltage. 
Equation  5.1  can  also  be  written  as 

IC=  X0  +  al  Vbe  +  a2  Vbe2  +  a3  Vbe3  +  '"  5'2 

where  a  is  the  n   order  coefficient  in  Equation  5.1. 


TO 
In  the  ideal  case  of  quadratic  mixing,  i.e.  with  a.  =  0  i  >  2, involving  two 
pure  sinusoids,  no  spurious  response  could  occur.   However,  for  a  practical 
mixer,  the  coefficients  of  higher  degree  are  not  zero,  resulting  in  cross- 
modulation,  modulation  deepening  and  spurious  responses.   For  a  pulse  width 
modulation  scheme  as  in  the  case  of  the  APE  machine,  the  first  two  phenomena 
have  less  effect  on  the  reception  than  the  spurious  responses.   With  the 
above-mentioned  frequency  assignment,  the  image  response  occurs  in  the  hand 
between  36.62  MHz  and  1+1.92  MHz.   These  image  frequencies  are  far  away  from 
any  signal  frequencies,  they  therefore  cause  no  significant  interference. 
The  same  is  true  for  the  first  order  subharmonic  response.   In  this  case,  the 
response  is  caused  by  the  higher  even  degree  terms,  such  as  the  1+th,  6th,  ... 
etc.,  as  these  terms  cause  the  harmonics  of  the  beat  frequencies  to  appear. 
Let  the  wanted  input  signal  be  f  ;  the  spurious  response,  f  ;  the  IF  frequency, 

f  ;  and  the  local  frequency,  f  ,  with 
i  e 

f  =  f  +  f.  5.3 

e    w    i  ^ 

The  first  order  subharmonic  spurious  response  is  given  by 

f . 

f  =  f  +  7T  5> 

s    w   2  ' 

f . 

When  this  f  beats  with  f  ,  the  result  is  ~,    the  first  order  subharmonic 
s  e  d  ■ 

of  the  f  .   According  to  Equation  5.1+,  this  type  of  spurious  response  for 
fw  =  15.00  MHz  is  20.1+05  MHz.   The  frequencies  of  this  type  of  spurious 
response  for  higher  frequency  channels  are  of  course  higher  in  frequency. 
Therefore,  they  all  lie  outside  the  active  band  of  frequencies  and  cuase  no 
significant  interference! 

For  higher  order  but  less  significant  spurious  response,  it  is  not 
possible  to  eliminate  all  unwanted  signals  completely  from  the  active  band. 
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However,  a  properly  chosen  IF  frequency  places  the  next  most  significant 
spurious  response  as  far  away  from  data  channels  as  possible.   The  next  most 
significant  spurious  response  is  the  second  order  image  response.   It  is 
caused  by  the  second  harmonics  of  the  unwanted  signal  heating  with  the  local 
frequency  to  produce  fjF  .   For  example,  let  f  =15.00  MHz,  f   =  10.811  MHz, 
the  local  frequency  is  therefore  f  =  25.811  MHz.   Wow  if  an  unwanted 
signal  exists  at  frequency  f  =  18.311  MHz,  its  second  harmonic  beating 
the  local  frequency  will  produce  exactly  the  intermediate  frequency.   If 
f  =  18.311  MHz  happens  to  be  another  data  channel  frequency  transmitting  data 
at  much  higher  power  level,  harmful  interference  could  occur.   The  intermediate 
frequency  obviously  affect  the  location  of  these  spurious  response.   The 
optimum  IF  frequency  can  affect  the  location  of  these  spurious  responses. 
The  optimum  IF  frequency  can  be  shown  to  be  10.811  MHz.   Figure  5.9  shows 
different  type  of  spurious  response. 

The  electronic  tuning  of  the  receiver  is  done  by  means  of  a 
reverse-biased  diode.   It  is  well  known  that  the  junction  capacitance  of  a 
reverse  biased  diode  is  given  by 

C-X, 

where  C   is  the  capacitance  at  zero  bias 
0 

V  is  the  reverse-biased  voltage 

4>  is  the  contact  potential 

£  is  a  constant  determined  by  the  impurity  gradient  of  the 
diode  (for  an  abrupt  junction  £  =  — ) 
The  tuning  voltage  ranges  from  1  volt  to  6  volts  is  supplied  from  the  decoder 
for  function  as  described  before.   The  tuning  circuit  and  the  other  part  of 
the  receiver  are  shown  in  Figure  5.10. 
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5.9.2  The  U2.5  MHz  Clock  Receiver 

This  receiver  consist  of  a  2-stage  tuned  RF  amplifier  followed 
by  an  AM  detecting  stage.   The  total  power  consumption  is  3  ma  from  a 
6-volt  source.   The  sensitivity  is  200  microvolts, and  its  bandwidth  is  about 
1  MHz.   The  circuit  diagram  of  this  receiver  is  given  in  Figure  5.11. 

5.9.3  The  Switching  Transmitter 

The  transmitter  is  shown  in  Figure  5.12.   It  is  a  crystal-controlled 
low  power  switching  transmitter  developed  in  the  Computer  Laboratory  to 
satisfy  the  particular  requirements  of  the  APE  machine.   It  is  a  tune-gate  and 
tune-drain  oscillator.   The  second  gate  of  the  dual-gate  FET  is  used  to 
switch  the  gain  to  turn  on  or  turn  off  the  oscillator.   The  transistor 
2N2U75  is  used  to  reduce  the  standby  power  consumption  while  the  2N289^A  is 
used  for  cutting  down  the  Q  value  of  the  tank  circuit  when  the  transmitter 
is  turned  off,  i.e.  in  order  to  produce  a  RF  pulse  with  a  sharp  decay.   The 
total  power  consumption  at  15%  duty  cycle  is  about  6  mW. 
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6.   THE  APE  CONTROL  UNIT 
6.1      Block  Diagram  of  the  APE  Control  Unit 

The  "block  diagram  of  the  APE  control  unit  is  shown  in  Figure  6.1.   It 
can  "be  divided  into  three  major  functional  units.   The  first  one  is  the  program 
instruction  encoder  and  transmitter.   The  program  instructions  are  put  together 
by  the  settings  of  the  operation  selector  and  of  the  channel  selectors  for  any 
type  of  operation,  except  storage.   In  the  case  of  storage,  the  tuning  informa- 
tion is  not  needed.   Instead,  the  number  to  be  stored  has  to  be  incorporated 
into  the  instruction.   This  number  is  chosen  in  signed  decimal  representation 
by  the  thumbwheel  switch  and  converted  into  a  10-bit  machine  number  in  binary 
representation.   The  multiplexer,  controlled  by  the  operation  selector,  then 
puts  this  10-bit  binary  number  into  the  instruction  code  generator,  instead 
of  the  information  from  the  channel  selector  as  in  the  case  of  other  operations. 
The  instruction  code  generator  is  used  to  encode  the  instruction  into  a 
sequence  of  pulse-width-modulated  pulses.   This  sequence  of  pulses  is  then 
used  to  modulate  the  carrier  frequency  of  the  channel  over  which  the  instruction 
is  to  be  transmitted.   The  second  functional  unit  is  the  clock  generation 
and  transmission  unit.   It  consists  of  a  composite  clock  generator  driven 
by  a  master  oscillator  and  a  U2.5  MHz  transmitter.   The  third  functional  unit 
is  the  output  monitoring,  decoding  and  display  unit.   It  consists  of  an 
all-channel  receiver,  a  machine-number-to-BCD  converter,  a  scaler,  a  Nixie 
display  unit,  an  "alive  indicator",  and  finally  an  ASCII  encoder  for  interfac- 
ing with  a  teletype.   The  output  of  any  APE  channel  can  be  monitored  by  the 
all-channel  receiver.   If  the  APE  machine  is  in  the  test  mode,  the  signal 

receiver  is  used  to  drive  the  alive  indicator.   In  the  execution  mode,  the 
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received  signal  is  the  output  data  from  the  APE  of  that  channel.   This  data 
is  in  machine  numbers  and  is  converted  into  signed  BCD  codes  for  decimal 
display.   The  1997/1023  scaler  is  to  scale  a  10-bit  binary  machine  number  into 
three  decimal  digits  plus  sign.   In  the  following  sections,  detailed  discus- 
sion will  be  given  to  the  non-trivial  functional  blocks. 
6.2      The  Instruction  Code  Generator 

It  consists  of  a  pulse  sequence  generator,  a  counter,  a  multiplexer, 
and  a  monostable  multivibrator  as  shown  in  Figure  6.2.   The  pulse  sequence 
generator  produces  a  sequence  of  32  pulses  when  a  starting  signal  is  received. 
The  counter  and  the  multiplexer  are  connected  to  form  a  parallel-to-serial 
converter.   The  instruction  word  is  fed  to  the  data  inputs  of  the  multiplexer. 
As  the  counter  starts  to  count,  the  inputs  to  the  multiplexer  are  gated 
sequentially  to  the  output.   The  output  of  the  multiplexer  is  used  to  modulate 
the  timing  network  of  a  monostable  multivibrator  by  switching  an  auxiliary 
timing  resistor  in  and  out  of  the  timing  network.   The  output  from  the  mono- 
stable  multivibrator  produces  therefore  a  binary-pulse-width-modulated  signal. 
Figure  6.3  shows  the  actual  circuit  of  the  instruction  code  generator.   It 
should  be  pointed  out  that  the  instruction  code  generator  is  designed  to 
handled  instructions  up  to  32  bits  long.   (For  the  APE  machine,  the  instruction 
is  only  28  bits  long! ) 

Figure  6.3  shows  the  actual  circuit  of  the  instruction  code  generator. 
Two  SN7Ul50N's  are  employed  to  form  the  32-bit  multiplexer.   The  diode  D 
is  used  to  switch  the  k.fK   auxiliary  timing  resistor  R   in  and  out  of  the 
network.   The  ready  indicator  circuit  is  to  turn  on  an  indicator  light  when 
the  instruction  code  generator  has  completed  sending  out  a  full  instruction 
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pigure  6.2   A  Block  Diagram  of  the  Instruction  Code  Generator 
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word  and  is  ready  for  loading  for  another  word  in  another  APE  channel. 
Figure  6.U  shows  an  instruction  word  and  the  corresponding  pulse-width- 
modulated  signal. 
6.3      The  9-Channel  Instruction  Transmitter 

This  transmitter  consists  of  nine  separate  crystal-controlled  RF 
oscillators  and  modulators.   Whenever  an  instruction  is  to  be  transmitted 
over  a  specific  channel,  the  power  to  the  corresponding  RF  oscillator  and 
modulator  is  switched  on.   Otherwise,  their  power  supplies  are  switched  off. 
This  simplifies  a  great  deal  of  the  RF  shielding  problem  because  the  RF 
signals  of  the  instruction  transmitter  are  generated  only  when  they  are 
needed  during  the  programming  mode.   The  schematics  of  the  transmitter  for 
each  channel  is  given  in  Figure  6.5:   It  is  essentially  a  gated  RF  source. 
The  crystal-controlled  oscillator  generates  the  carrier  which  is  sent 
through  a  tuned  amplifier  to  remove  the  harmonics  and  is  amplified  to  about 
one  volt  in  amplitude.   Then  it  is  fed  to  the  first  gate  of  a  dual  gate  FET. 
The  second  gate  of  the  FET  is  switched  between  +8.5  volts  and  -5  volts.   When 
it  is  held  at  +8.5  volts,  the  dual  FET  is  operated  in  the  active  region.   The 
RF  signal  will  be  further  amplified  before  it  is  transmitted.   When  the 
second  gate  is  at  -5  volts,  the  FET  is  switched  off,  allowing  only  very 
small  amounts  of  RF  to  leak  through.   The  reason  for  using  the  dual  gate  FET 
is  its  extremely  small  gate-to-drain  capacitance  which  leads  to  a  better 
on-off  ratio  in  switching  the  RF  signals  than  would  be  possible  with  its  bipolar 
equivalent.   The  switching  voltage  is  provided  by  the  switching  circuit  of 
the  transistors  2N1308  and  2N1309.   The  diode  in  the  emitter  of  the  2N1308  is 
used  for  improving  the  noise  margin  of  the  input. 
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6.k  The  Clock  Signal  Transmitter 

The  U2.5  MHz  clock  transmitter  has  the  same  schematics  as  the 
instruction  transmitter  "but  is  operated  at  higher  frequency.   However,  the 
clock  transmitter  operates  during  the  execution  mode.   In  order  to  avoid 
interference  with  other  circuits,  it  is  physically  built  in  a  shielded  box. 
The  operation  principles  of  the  clock  signal  transmitter  are  identical  to 
those  of  the  instruction  transmitter  discussed  in  Section  6.3.   It  is  not 
repeated  here. 
6. 5      The  BCD/Machine  Number  Converter 

The  inputs  to  this  converter  are  three  decades  of  BCD  code  together 
with  a  sign  bit,  all  from  the  thumbwheel  switch  in  the  front  panel.   The 
output  is  a  10-bit  machine  number.   The  conversion  is  accomplished  in  two 
steps.   The   first  step  is  to  convert  the  signed  BCD  number  into  a  signed 
binary  number.   The  second  step  is  to  transform,  according  to  Equation  3.8,  the 
signed  binary  number  into  a  machine  number.   The  first  step  is  implemented 
with  six  U-bit  BCD/binary  converters  SN7^l8U,  connected  as  shown  in  Figure  6.6, 
The  transformation  in  the  second  step  corresponds  to  inverting  the  binary 
number  after  shifting  it  one  bit  position  to  the  right.   The  inversion  is  done 
by  nine  EXCLUSIVE-OR  gates  acting  as  TRUE/ COMPLEMENT  gates.   Shifting  is 
done  simply  by  wiring  the  output  binary  bits  into  the  shift  position.   Note 
that  in  the  machine  number,  a  negative  number  can  be  obtained  by  inverting 
a  positive  number  with  the  same  magnitude.   Therefore,  if  the  sign  bit  is  used 
to  control  the  TRUE/ COMPLEMENT  gates,  as  shown  in  the  figure,  the  result 
will  be  the  machine  number  equivalent  to  the  3-digit  signed  decimal  number 
of  the  inputs. 
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6.6  The  All-Channel  Receiver 

The  receiver  is  used  to  receive  the  output  data  signal  as  well  as  the 
test  reply  signal  from  the  APEs.   It  is  a  heterodyne  type  AM  receiver,  as 
shown  in  Figure  6.7.   Its  operation  is  similar  to  the  one  used  for  receiving 
input  data  in  the  APE,  except  that  this  all-channel  receiver  has  an  extra  IF 
stage:   Its  sensitivity  is  then  about  30  microvolts.   It  is  housed  in  a 
shielded  "box  to  reduce  the  interference  from  the  nosiy  environment  of  the 
TTL  logic  networks.   The  tuning  is  done  with  twelve  pairs  of  potentiometers. 
Each  pair  of  potentiometers  is  adjusted  to  produce  a  pair  of  voltages  for 
tuning  the  RF  input  resonant  circuit  and  the  local  oscillator  resonant  cir- 
cuit for  the  reception  of  a  particular  channel.   A  particular  pair  of  voltages 
is  switched  into  the  tuning  circuits  whenever  the  particular  channel  is 
to  "be  tuned  in. 

6.7  The  Machine  Number  to  BCD  Converter  and  the  1997/1023  Scaler 

After  the  input  data  to  the  APEs  are  processed,  the  results  are  sent 
out  "by  the  APEs  in  machine  number  representation.   For  easy  read-out  of  the 
results,  it  is  necessary  to  have  them  converted  back  to  signed  decimal  repre- 
sentation.  This  is  done  in  the  control  unit  by  the  1997/1023  scaler  together 
with  the  machine-number-to-BCD  converter. 

The  output  signals  from  the  APE  are  in  pulse-width  modulation.   Th3 
pulse  width  ranges  from  0  to  a  maximum  of  1023  clock  periods.   The  1997/1023 
scaler  is  to  linearly  map  a  number  in  this  range  into  a  number  ranging  from 
0  to  1997  (to  the  limit  of  the  truncated  least  significant  digit  ...)  Figure  6.8 
shows  the  block  diagram  of  the  scaler.   What  it  does  is  simply  to  delete  one 
pulse  for  every  forty  input  pulses.   A  fast  clock  having  a  frequency  twice  as 
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high  as  the  system  clock  is  used  in  the  input  of  the  scaler  for  comparison  with 
the  pulse-width  modulated  signal  from  the  output  of  the  APEs .   With  such  an 
arrangement,  the  number  of  the  output  pulses  is  that  of  the  input,  scaled  by 
a  factor  1997/1023. 

After  the  scaling,  an  inverse  transform  of  Equation  3.8  is  performed 
by  the  machine-number-to-BCD  converter  as  shown  in  Figure  6.9.   The  decade 
up-down  counters  are  preset  to  999  at  the  beginning  of  a  new  computing  cycle, 
just  before  the  counter  starts  to  count  down.   When  the  counter  reaches  the  all 
zero  state,  it  is  switched  to  count  up.   How  long  the  counting  process  lasts 
depends  on  the  output  signal  from  the  APE  channel  to  be  displayed.   If  the 
counting  process  ends  during  the  count-down  period,  the  sign  of  the  result  is 
positive.   Otherwise  it  is  negative.   In  both  cases,  the  magnitude  of  the 
result  is  given  by  the  value  of  the  final  count.   The  full  range  of  the  input 
to  this  converter  is  1997  pulses.   This  corresponds  to  an  output  range  from 
-999  to  +998.   Therefore,  the  combined  function  of  the  scaler  and  the  con- 
verter will  transform  a  10-bit  number  into  a  3-digit  signed  decimal  number  to 
within  the  accuracy  of  the  10-bit  number. 
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7.   THE  APE  SENSORS  AND  THE  REMOTE  POWER  SUPPLY  OF  THE  APEs 
7.1      The  APE  Sensor 

For  input  data  acquisition,  two  APE  sensors  have  "been  implemented  for 
the  APE  machine.   The  convert  a  light  intensity  into  a  pulse  width  modulated 
signal  with  the  leading  edge  of  the  pulse  synchronized  with  a  reference 
clock.   A  sensor  consists  of  a  clock  receiver,  a  switching  transmitter,  a 
synchronizing  signal  separator,  a  transmitter  timing  control,  and  a  light- 
intensity-to-pulse-width  converter,  as  shown  in  Figure  7.1.   As  in  the  case  of 
the  APEs,  the  output  of  the  clock  receiver  is  the  composite  clock  signal. 
The  synchronizing  signal  is  separated  from  the  composite  clock  signal  by 
the  separator.   This  synchronizing  signal  is  used  to  trigger  a  one-shot 
circuit  in  the  converter,  whose  timing  network  consists  of  a  photocell  and 
a  capacitor.   The  width  of  the  output  pulse  from  the  one-shot  therefore 
depends  on  the  light  intensity  which  the  photocell  senses.   The  leading  edge 
of  the  output  pulse  from  the  one-shot  is  in  synchronism  with  the  triggering 
signal,  which  is  the  synchronizing  signal.   The  output  of  the  one-shot, 
together  with  the  clock  signal  and  synchronizing  signal,  is  sent  to  the 
transmitter  timing  control  circuit,  whose  function  is  to  turn  the  transmitter 
on  and  off  at  appropriate  instants  of  time.   All  functional  blocks  in 
Figure  7.1  except  the  light  intensity  converter  are  identical  to  the  cor- 
responding parts  in  the  APEs:   Their  details  have  been  given  in  Chpater  6. 
The  light  intensity  converter  is  shown  in  Figure  7.2.   A  COS/MOS  AND  gate  and 
an  inverter  is  connected  to  form  the  one-shot  as  shown.   The  output  pulse 
width  is  depending  on  the  product  of  the  photoresistance  and  the  capacitance. 
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The  former  is  dependent  on  the  input  light  intensity.   Consequently,  the 
output  pulse  width  is  determined  by  the  input  light  intensity. 
7.2      The  Remote  Power  Supply  for  the  APE 

The  APE  machine  is  equipped  with  a  remote  power  supply  for  the  APEs. 
The  power  is  sent  to  the  APE  remotely  to  free  it  completely  from  any  physical 
connection  with  other  parts  of  the  APE  machine,  therefore  further  enchancing 
the  structural  flexibility  of  the  machine.   The  general  requirements  of  the 
remote  power  source  are:  l)  The  available  power  to  the  APEs  placed  in  the 
active  region  of  the  power  source  should  be  about  100  mW.   2)   The  operation 
of  the  remote  power  supply  must  not  interfere  with  the  operations  of  the  APE. 

This  part  of  the  APE  machine  has  been  investigated  originally  by 
another  graduate  student  in  the  Computer  Hardware  and  Systems  Research  Group. 
The  possibility  of  sending  power  to  the  APEs  over  a  microwave  frequency 
channel  was  first  considered.   A  detail  report  on  this  study  is  given  in 
reference  (21).   An  experimental  set-up  having  a  6 0 -watt  transmitter  at 
1296  MHz  with  a  helical  transmitting  antenna  was  built.   It  was  found  that 
almost  sufficient  power  for  operating  an  APE  could  be  delivered  to  an  APE 
which  is  equipped  with  a  quarter  wave  slotted  line  antenna  for  the  reception 
of  power  and  is  placed  a  few  feet  from  the  transmitting  antenna.   However, 
it  was  also  found  that  the  transmission  of  the  microwave  power  interferes 
with  the  operations  of  the  APEs.   As  described  in  Chapter  5,  each  APE  is 
equipped  with  three  receivers  operating  at  frequencies  between  15,000  MHz 
and  U2.5  MHz.   The  transistors  employed  in  those  receivers  have  necessarily 
a  high  gain-bandwidth  product.   Unfortunately,  these  transistors  also 
response  to  RF  signals  at  1296  MHz.   Without  elaborate  shielding  and 


96 

feed-through  filtering,  the  leakage  of  the  1296  MHz  RF  signal  into  the  box 
housing  the  APE  circuits  is  sufficient  to  upset  the  operations  of  the. 
receivers.   Some  experiments  have  been  conducted  to  house  the  APE  circuitry 
inside  an  elect romagnetically  shielded  box,  with  1296  MHz  resonant  traps 
guarding  all  feed-through  terminals  feeding  data  signals  into  the  APE  cir- 
cuits.  With  careful  tuning  of  the  1296  MHz  resonant  traps,  the  interference 
could  be  reduced  to  an  acceptable  level.   However,  such  tuning  is  quite 
critical  and  could  be  detuned  easily  by  coupling  with  nearby  objects.   Hence, 
this  approach  is  not  employed  for  remotely  powering  the  APEs. 

The  more  suitable  solution  to  the  remote  powering  of  the  APE  is 
by  means  of  solar  cells.   This  approach  eliminates  the  harmful  RF  interference. 
These  solar  cells  are  constructed  with  a  n-type  silicon  base  material.   A 
very  thin  layer  of  p-type  material  is  formed  on  the  n-type  base  through  dif- 
fusion to  produce  a  p-n  junction  with  large  area.   When  this  p-n  junction  is 
short-circuited  in  darkness,  no  steady  current  will  flow  in  the  external  cir- 
cuit inspite  of  the  existance  of  the  contact  potential  of  the  p-n  junction 
as  expected  on  thermodynamic  grounds.   However,  if  light  in  a  suitable  range 
of  wavelength  is  allowed  to  fall  on  the  p-n  junction,  a  voltage  will  develop 
across  the  external  circuit  and  current  starts  to  flow  with  a  terminal  char- 
acteristic as  shown  in  Figure  7.3.   More  details  about  such  a  photovoltaic  effect 
can  be  found  in  many  good  references  (22)  (23).   Figure  7.U  shows  the  spectral 
response  of  the  solar  cells.   For  powering  the  APE,  four  solar  modules, 
designated  5SM1020GE10PL  by  International  Rectifier,  are  packed  together 
on  the  top  surface  of  the  APE  to  absorb  power  from  an  array  of  incandescent 
lamps. 
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Figure  7.1+   Spectral  Response  of  Photocells 
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8.   CONCLUSION  AND  OUTLOOK 

With  the  help  of  the  latest  development  in  COS/MOS  integrated  circuits 
and  high  gain-bandwidth-product  transistors,  the  APE  machine  has  been  success- 
fully implemented.   The  maximum  power  consumption  of  an  APE  is  actually  about 
TOmW.   The  APE  machine,  as  described  in  this  thesis,  has  been  operating  satis- 
factorily.  It  is  the  world's  first  computer  with  variable  topology  of  its  kind. 
It  is  hoped  that  it  will  open  up  a  new  dimension  in  computer  system  and  circuit 
design  for  highly  flexible  and  reliable  computers,  with  such  advanced  features 
as  variable  topology,  incrementable  computing  power,  readily  mass-producible 
structure  as  well  as  self-checking  and  self-repairing  capabilities. 

In  regard  to  the  fault-tolerant  features  of  the  APE  machine,  only 
static  checking  procedures,  such  as  the  alive  testing,  have  been  implemented. 
Dynamic  testing  could  easily  be  implemented  by  employing  a  highly  reliable  APE 
as  the  checker  inside  the  control  unit  and  comparing  the  output  of  the  channel 
to  be  checked  and  that  of  the  checker  with  identical  inputs  and  operations. 
Furthermore,  these  checking  procedures  can  be  done  automatically  rather  easily 
with  the  type  of  structure  found  in  the  APE  machine:   Whenever  a  failure  is 
discovered  in  a  specific  channel,  the  element  could  be  replaced  automatically. 
If  there  are  no  more  spare  channels  to  replace  the  defective  ones,  the  control 
unit  could  be  readily  programmed  either  to  notify  the  operator  or,  in  some 
inaccessible  circumstances,  to  reconfigure  the  topology  of  the  set  of  APEs  such 
that  the  required  number  of  APE  channels  is  reduced  to  the  available  number, 
with  some  lower-priority  functions  of  the  program  being  cut  out.   To  match  the 
reliability  of  the  other  parts  of  the  computer,  information  transmission  of  the 
APE  machine  should,  of  course,  employ  error  checking  and  error  correcting  codes. 
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The  APE  machine  described  in  this  thesis  made  use  of  RF  linkages  to 
transmit  data.   There  is  obviously  a  limit  to  the  number  of  APE  channels  that 
could  be  placed  in  a  given  frequency  band.   This  limitation  could  be  overcome 
with  some  sacrifice  of  the  flexibility  of  the  APE  machine.   For  example,  if 
information  transmission  is  carried  out  over  a  confined  space,  such  as  a  com- 
mon bus,  instead  of  the  free  space,  then  the  number  of  APE  channels  could  be 
increased  by  using  several  different  confined  spaces.   In  this  case,  special 
consideration  must  of  course  be  given  to  the  communication  between  the  units 
associated  with  different  confined  space. 

As  an  alternative  approach  to  the  design  of  the  communication  subsystem 
for  the  APE  machine,  a  time-multiplexing  scheme  can  be  used  instead  of  the 
frequency  division  scheme  employed  in  the  APE  machine.   In  the  time-multiplex- 
ing approach,  the  data  receivers  of  all  APEs  are  operated  over  the  same  fixed 
frequency  channel  having  a  much  wider  channel  width.   Then  each  type  of  APE 
is  given  a  specific  portion  of  the  communication  period  to  transmit  the  output 
data.   To  receive  data  from  a  specific  type  of  APE.   The  data  input  register 
is  programmed  to  take  in  data  only  during  the  specific  portion  of  the  communica- 
tion period  reserved  for  that  type  of  APE.   Such  an  approach  eliminates  the 
need  of  tuning  the  data  input  receivers  and  requires  only  one  data  receiver 
instead  of  two.   On  the  other  hand,  additional  logic  circuits  are  required 
in  this  approach.   Furthermore,  it  reduces  the  degree  of  homogeneity  of  circuits 
between  different  types  of  APE. 

As  described  in  Chapter  3,  the  fluctuation  of  the  result  obtained 
by  stochastic  processing  would  increase  with  the  increase  of  the  number  of  cas- 
caded stages.   This  would  limit  the  number  of  stages,  and  ultimately  the  number 


1Q0 
of  APEs,  that  could  be  used.   This  difficulty  could  not  be  overcome  with 
stochastic  computation  using  truly  random  SRPSs.  Nevertheless,  this  fluctuation 
problem  could  be  completely  eliminated  with  special  pseudo  random  SRPS  whose 
period  equals  precisely  the  sampling  integration  period  for  the  estimation  of 
the  mean  value:   Because  the  integration  is  taken  over  the  entire  period,  it 
follows  from  the  periodicity  of  the  SRPS  that  the  same  result  is  obtained  for 
every  integration  period.   More  importantly,  the  result  thus  obtained  can  be 
shown  to  be  the  mean  value  of  the  SRPS.   In  other  words,  the  correct  result 
is  obtained  every  time  without  fluctuation. 
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